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ABSTRACT 

We present a comprehensive space-based study of ten X-ray luminous galaxy clusters 
(Lx^8xlO'^''ergs"H0.1-2.4keV]) at z=0.2. Hubble Space Telescope (HST) observations 
reveal numerous gravitationally-lensed arcs for which we present four new spectroscopic red- 
shifts, bringing the total to thirteen confirmed arcs in this cluster sample. The confirmed arcs 
reside in just half of the clusters; we thus obtain a firm lower limit on the fraction of clusters 
with a central projected mass density exceeding the critical density required for strong-lensing 
of 50%. We combine the multiple-image systems with the weakly-sheared background galax- 
ies to model the total mass distribution in the cluster cores (i?^500kpc). These models are 
complemented by high-resolution X-ray data from Chandra and used to develop quantitative 
criteria to classify the clusters as relaxed or unrelaxed. Formally, (30±20)% of the clusters 
form a homogeneous sub-sample of relaxed clusters; the remaining (70±20)% are unrelaxed 
and are a much more diverse population. Most of the clusters therefore appear to be experienc- 
ing a cluster-cluster merger, or relaxing after such an event. We also study the normalization 
and scatter of scaling relations between cluster mass, luminosity and temperature. The scatter 
in these relations is dominated by the unrelaxed clusters and is typically cr~0.4. Most notably, 
we detect 2-3 times more scatter in the mass-temperature relation than theoretical simula- 
tions and models predict. The observed scatter is also asymmetric - the unrelaxed systems 
are systematically 40% hotter than the relaxed clusters at 2.5-cr significance. This structural 
segregation should be a major concern for experiments designed to constrain cosmological 
parameters using galaxy clusters. Overall our results are consistent with a scenario of cluster- 
cluster merger induced boosts to cluster X-ray luminosities and temperatures. 

Key words: cosmology: observations — gravitational lensing — clusters of galaxies: individ- 
ual: A68, A209, A267, A383, A773, A963, A 1763, A 1835, A2218, A2219 — galaxies: 
evolution 



1 INTRODUCTION 

Massive galaxy clusters are the largest collapsed structures in the 
Universe (Mviriai — lO^^M©), containing vast quantities of the pu- 
tative dark matter (DM), hot intracluster gas (fcTx— TkeV), and 
galaxies (ngai^lO''). These rare systems stand at the nodes of the 
"cosmic web" as defined by the large-scale filamentary structure 
seen in both galaxy redshift surveys (e.g. de Lapparent, Geller & 
Huchra 1986; Shectman et al. 1996; Vettolani et al. 1997; Peacock 



et al. 2001; Zehavi et al. 2002) and numerical simulations of struc- 
ture formation (e.g. Bond et al. 1996; Yoshida et al. 2001; Evrard 
et al. 2002). Clusters are inferred to assemble by accreting matter 
along the filamentary axes, slowly (tcrossing~2-3 Gyr) ingesting 
DM, gas and stars into their deep gravitational potential wells. 

Clusters have long been recognized as cosmological probes. 
For example, the evolution of cluster substructure with look-back- 
time is in principal a powerful diagnostic of the cosmological pa- 
rameters (Gunn & Gott 1972; Peebles 1980; Richstone, Loeb & 
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Turner 1992; Evrard et al. 1993). A complementary probe is to con- 
strain the matter density of the universe and the normalization of the 
matter power spectrum using the cluster mass function. However, 
it is currently not possible to measure the cluster mass function 
directly. More easily accessible surrogates such as the X-ray lumi- 
nosity and temperature functions are therefore used in combination 
with scaling relations between the relevant quantities (e.g. Eke et al. 
1996; Reiprich & Boehringer 2002; Viana, Nichol & Liddle 2002; 
Allen et al. 2003). A critical component of such analyses is the pre- 
cision to which the scaling relations are known. Samples of X-ray 
selected clusters are now of a sufficient size that systematic uncer- 
tainties may be comparable with the statistical uncertainties, and 
therefore deserve careful analysis before robust cosmological con- 
clusions may be drawn (e.g. Smith et al. 2003). Measurements of 
the Sunyaev-Zeldovich effect (SZE) are also emerging as a power- 
ful cosmological tool (Carlstrom, Holder Sl Reese 2002). Cosmo- 
logical SZE surveys will rely on the cluster mass-temperature rela- 
tionship in a similar manner to cosmological X-ray surveys. Such 
experiments may therefore also be compromised if astrophysical 
systematics are identified and carefully eliminated from the analy- 
sis (e.g. Majumdar & Mohr 2003). Detailed study of the assembly 
and relaxation histories of clusters, and their global scaling rela- 
tions as a function of redshift are therefore vitally important. 

To advance our understanding of the assembly, relaxation and 
thermodynamics of massive galaxy clusters requires information 
about the spatial distribution of DM, hot gas and galaxies in clus- 
ters. Several baryonic mass tracers are available, for example X-ray 
emission from the intracluster medium (hereafter ICM - e.g. Jones 
& Forman 1984; Buote & Tsai 1996; Schuecker et al. 2001) and 
the angular and line-of-sight velocity distribution of cluster galax- 
ies (e.g. Geller & Beers 1982; Dressier & Shectman 1988; West 
& Bothun 1990). These diagnostics have often been used as sur- 
rogates for a direct tracer of the underlying DM distribution. The 
major drawback of this approach is the requirement to assume a re- 
lationship between the luminous and dark matter distributions (e.g. 
that the ICM is in hydrostatic equilibrium with the DM potential) - 
it is precisely these assumptions that require detailed testing. This 
issue is further aggravated by the expectation that cluster mass dis- 
tributions are DM dominated on all but the smallest scales (e.g. 
Smith et al. 2001; Sand, Treu & Ellis 2002; Sand et al. 2004). 

Gravitational lensing offers a solution to much of this prob- 
lem, in that the lensing signal is sensitive to the total mass distribu- 
tion in the lens, regardless of its physical nature and state. Detailed 
study of gravitational lensing by massive clusters is therefore an 
important opportunity to gain an empirical understanding of the 
distribution of DM in clusters. Indeed, early comparisons between 
X-ray and lensing-based mass measurements revealed a factor 2-3 
discrepancy between X-ray and strong-lensing-based cluster mass 
estimates (e.g. Miralda-Escude & Babul 1995; Wu & Fang 1997), 
although the agreement between weak-lensing and X-ray measure- 
ments was generally better, albeit within large uncertainties (e.g. 
Squires et al. 1996, 1997; Small et al. 1997). The simplifying as- 
sumptions involved in the X-ray analysis were soon identified as 
the likely dominant source of this discrepancy; this was confirmed 
by several authors (e.g. Allen 1998; Wu et al. 1998; Wu 2000). In 
summary, X-ray and lensing mass measurements for the most re- 
laxed clusters agree well if the multi-phase nature of the ICM in 
cool cores (e.g. Allen et al. 2001) is incorporated into the X-ray 
analysis. The situation is more complex in more dynamically dis- 
turbed clusters, with larger discrepancies being found at smaller 
projected radii. The origin of the X-ray versus lensing mass dis- 
crepancy in clusters that do not contain a cool core is generally at- 



tributed to the simplifying equilibrium and symmetry assumptions 
of the X-ray analysis. For that reason, most modem X-ray clus- 
ter analyses that involve measuring cluster mass using only X-ray 
data understandably concentrate on relaxed, cool core clusters (e.g. 
Allen etal. 2001). 

An important caveat to adopting lensing as the tool of choice 
to measure cluster mass is that lensing actually constrains the pro- 
jected mass distribution along the line-of-sight to the cluster. The 
addition of three-dimensional information into lensing studies may 
therefore be important before final conclusions are drawn. For ex- 
ample Czoske et al.'s (2001; 2002) wide-field redshift survey of 
CI 0024+1654 at 2=0.395 revealed that this previously presumed 
relaxed strong-lensing cluster (e.g. Small et al. 1996; Tyson et al. 
1998) cluster is not relaxed, and appears to have suffered a recent 
merger along the line-of-sight (see also Kneib et al. 2003). 

Early gravitational lensing studies of galaxy clusters concen- 
trated on individual clusters selected because of their prominent 
arcs (e.g. Mellier et al. 1993; Kneib et al. 1994, 1995, 1996; Small 
et al. 1995a, 1996; Allen et al. 1996; Tyson et al. 1998). This 
"prominent arc" selection function was vital to development of the 
techniques required to interpret the gravitational lensing signal (e.g. 
Kneib 1993). However it also made it difficult to draw conclusions 
about galaxy clusters as a population of astrophysical systems from 
these studies. Small et al. (1997) used the Hubble Space Telescope 
(HST) to study a larger sample of optically rich clusters, selected 
originally for the purpose of studying the cluster galaxies. As X- 
ray selected samples became available in the late-1990's, Luppino 
et al. (1999) also searched for gravitational arcs in ground-based 
imaging of 38 X-ray luminous clusters. The broad conclusions to 
emerge from these pioneering studies were that to use gravitational 
lensing to learn about clusters as a population, a selection func- 
tion that mimics mass-selection as closely as possible, and the high 
spatial resolution available from HST imaging are both key require- 
ments. 

We are conducting an HST survey of an objectively selected 
sample of ten X-ray luminous (and thus massive) clusters at z~0.2 
(Table \\\ 32). Previous papers in this series have presented (i) 
a detailed analysis of the density profile of A 383 (Smith et al. 
2001), (ii) a search for gravitationally-lensed Extremely Red Ob- 
jects (EROs - Smith et al. 2002a) and (ill) near-infrared (NIR) 
spectroscopy of ERO J003707, a multiply-imaged FRO at z—1.6 
behind the foreground cluster A 68 (Smith et al. 2002b). This pa- 
per describes the gravitational lensing analysis of all ten clusters 
observed with HST and uses the resulting models of the cluster 
cores to measure the mass and structure of the clusters on scales 
of _R^500 kpc. We also exploit archival Chandra observations and 
NIR photometry of likely cluster galaxies to compare the distribu- 
tion of total mass in the clusters with the gaseous and stellar compo- 
nents respectively. This combination of strong-lensing. X-ray and 
NIR diagnostics enable us to quantify the prevalence of dynami- 
cal immaturity in the X-ray luminous population at 2~0.2 and to 
calibrate the high-mass end of the cluster mass-temperature rela- 
tionship. 

We outline the organization of the paper. In S|2|we describe the 
survey design and sample selection. We then describe the reduction 
and analysis of the optical data in ij3| comprising the HST imaging 
data ( i|3.1t and new spectroscopic redshift measurements for arcs 
in A 68 and A 2219 ( jT2l . The end-point of Jsjis a definition of the 
strong- and weak-lensing constraints available for all ten clusters. 
We apply these constraints in f|4| to construct detailed gravitational 
lens models of the cluster potential wells; the details of the mod- 
elling techniques are described in the Appendix, and the process of 
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Table 1. Summary of Hubble Space Telescope Observations 
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A 00 
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( .0 


A 209 


01 31 52.53 -13 36 40.5 


0.209 


15.2+1.0 


7.8 


A 267 


01 52 41.97 +01 00 26.2 


0.230 


11.1+0.9 


7.5 


A 383 


02 48 03.38 -03 31 45.7 


0.187 


9.8+0.3 


7.5 


A 773 


09 17 53.37 +51 43 37.2 


0.217 


12.5+2.1 


7.2 


A 963 


10 17 03.57 +39 02 49.2 


0.206 


13.4+1.0 


7.8 


A 1763 


13 35 20.10+41 00 04.0 


0.228 


14.2+2.1 


7.8 


A 1835 


14 01 02.05 +02 52 42.3 


0.253 


38.3+0.9 


7.5 


A2218 


16 35 49.22 +66 12 44.8 


0.171 


9.0+0.8 


6.5 


A2219 


16 40 19.82 +46 42 41.5 


0.228 


19.8+2.2 


14.4 




given in the [0.2-2.4 keV] pass- 


-band in units of 10'*'' erg s '. 



Luminosities are taken from XBACs catalog (Ebeling et al. 1996) un- 
less measurements based on pointed observations are available: A 383, 
Smith et al. (2001); A209, A267, A963, A 1835, Allen et al. (2003). 

fitting the constraints in each cluster are described in S|4| We then 
complement these gravitational lensing results with observations of 
the X-ray emission from the clusters' ICM, dra'wn from the Chan- 
dra data archive (ijs}- The main results of the paper are then pre- 
sented in f|6| including measurements of the mass and maturity of 
the clusters and a detailed study of the cluster scaling relations. We 
discuss the interpretation of the results in f0and briefly assess their 
impact on attempts to use clusters as cosmological probes. Finally, 
we summarize our conclusions in ^ 

We assume a spatially flat universe with 
i?o=50kms~^Mpc~^ and qQ—0.5; in this cosmology 
l"=4.2kpc at z=0.2. Our main results are insensitive to 
this choice of cosmology, for example, the cluster mass measure- 
ments would be modified by <10% if we adopted the currently 
favored values of SIm^O.S, f2A=0.7, /fo=65kms~'Mpc~'. 
We also adopt the complex deformation, T—Tx+iry = \T\e'^^^, 
as our measure of galaxy shape when dealing with the weak 
lensing aspects of our analysis, where T={a^ +b^) /2ab and 9 is 
the position angle of the major axis of the ellipse that describes 
each galaxy. We define the terms "ellipticity" to mean r and 
"orientation" to mean 9. All uncertainties are quoted at the 68% 
confidence level. 



2 SAMPLE SELECTION 

We aim to study massive galaxy clusters, and so would prefer to 
select clusters based on their mass. Mass-selected cluster catalogs 
extracted from ground-based observations are gradually becoming 
available (e.g. Miyazaki et al. 2002; Wittman et al. 2003), how- 
ever the blurring effect of the atmosphere make the completeness 
of these weak-lensing cluster catalogs very difficult to characterize 
robustly. These surveys are also unlikely to achieve the sky cov- 
erage (of order full-sky) required to detect a large sample of the 
rarest and most massive systems which are the focus of our pro- 
gram. In contrast. X-ray selected cluster catalogs (e.g. Gioia et al. 
1990; Ebeling et al. 1998, 2000; De Grandi et al. 1999) based on 
the ROSAT All-Sky Survey are already available in the public do- 
main with well-defined completeness limits. X-ray selection also 
influences the choice of survey epoch because the completeness of 
the X-ray cluster catalogs at the time that we applied for HST time 
in Cycle 8 (GO~8249) fell off rapidly beyond 2~0.3. We there- 
fore adopt z—0.2 as the nominal redshift of our cluster sample. 



This redshift is also well-suited to a lensing survey because the 
observer-lens, observer-source and lens-source angular diameter 
distances (Dol, i^os, -Dls) that control the power and efficiency 
of gravitational lenses render clusters at 2~0.2 powerful lenses for 
background galaxy populations at 2~0.7-1.5. This redshift interval 
is well-matched to the current generation of optical spectrographs 
on 10-m class telescopes. 

Accordingly, we select ten of the most X-ray luminous 
clusters (Lx^SxlO** ergss"\ 0.1-2.4 keV) in a narrow red- 
shift slice at 0.17^2^0.25, with minimal line-of-sight reddening 
(£(B-y)^0.1) from the XBACs sample (X-ray Brightest Abell- 
type Clusters; Ebeling et al. 1996). These clusters span the full 
range of X-ray properties (morphology, central galaxy line emis- 
sion, cooling flow rate, core radius) found in larger X-ray luminous 
samples (e.g. Peres et al. 1998; Crawford et al. 1999). The median 
X-ray luminosity of the sample is 13x lO^^erg s~^. We list the 
cluster sample in TableQ As XBACs is restricted to Abell clusters 
(Abell, Corwin, & Olowin 1989), the sample is not strictly X-ray 
selected. However, a comparison with the X-ray selected ROSAT 
Brightest Cluster Sample (BCS; Ebeling et al. 1998, 2000a) shows 
that 18 of the 19 BCS clusters that satisfy our selection criteria are 
either Abell or Zwicky clusters. This confirms that our sample is 
indistinguishable from a genuinely X-ray selected sample. 



3 OPTICAL DATA AND ANALYSIS 

3.1 HST Observations and Data Reduction 

All ten clusters were observed through the F702W filter using the 
WFPC2 camera on board HST^. The total exposure time for each 
cluster is listed in TableQ We adopted a three-point dither pattern 
for the eight clusters (A 68, A 209, A 267, A 383, A 773, A 963, 
A 1763, A 1835) observed in Cycle 8: each exposure was shifted 
relative to the previous one by ten WFC pixels (~1.0") in x and y. 
The archival observations of A 22 1 8 follow the same dither pattern, 
except the offsets were three WFC pixels in x and y. A 22 19 was 
observed with a six-point dither pattern that comprised two three- 
point dithers each of which were identical to that used for the Cy- 
cle 8 observations. These two dither patterns were offset from each 
other by 10 pixels in x and y. 

We measure the actual dither pattern and compare it with the 
commanded integer pixel offsets; the median difference between 
the commanded and actual offsets is 0.2 pixels, and generally lies 
in the range ~0- 0.4 pixels. The geometrical distortion at the edge 
of each chip (Gilmozzi et al. 1995; Holtzmann et al. 1995; Trauger 
et al. 1995; Casertano & Wiggs 2001) translates to an additional 
~0.2 pixel shift at the edge of each detector, falling to zero at the 
chip centers. Our observations therefore sub-sample the 0.1" WFC 
pixels at a level that varies spatially in the range ~0- 0.5 pixels. 
We therefore use the DITHER package (Fruchter & Hook 2002) to 
reduce the HST data because this allows us to correct for the ge- 
ometrical distortion and to recover a limited amount of spatial in- 
formation from the under-sampled WFPC2 point spread function 
(PSF). The final reduced frames (Fig.0 have a pixel scale of 0.05" 
and an effective resolution of FWHM=0.17". 



* Based on observations with the NASA/ESA Hubble Space Telescope ob- 
tained at the Space Telescope Science Institute, which is operated by the 
Association of Universities for Research in Astronomy, Inc., under NASA 
contract NAS 5-26555. 
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Figure 1. The HSTIWPPC2 frames of all ten clusters on a logarithmic scale and a false colour table. We overplot the weak shear field; these vectors show the 
mean elhpticity and orientation of faint background field galaxies on a grid that has been smoothed with a Gaussian of (t=10". The tickmarks are centered on 
the central galaxy (TablelTl and are separated by 10". 
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3.2 Identification and Confirmation of Multiple-image 
Candidates 

The primary reason for observing the cluster cores with HST is to 
take advantage of the exquisite spatial resolution of these data to 
identify multiply-imaged galaxies. Spectroscopic confirmation of 
such systems provides very tight constraints on the absolute mass of 
each cluster (roughly an order of magnitude better than is available 
from weak-lensing), and the spatial distribution of the cluster mass. 

We therefore begin the analysis by searching the HST frames 
for candidate multiple-image systems. This search combines vi- 
sual inspection of the data (looking for symmetric image pairs and 
tangentially or radially distorted arcs) with the SExtractor source 
catalogs described in il3.3l The effective surface brightness limit of 
the search in regions not affected by bright cluster galaxies is there- 
fore /i702— 25 mag arcsec"^ ( i|3.3> . To overcome the influence of 
bright cluster members, we generated unsharp-masked versions of 
the science frames, thus removing most of the flux from the bright 
galaxies. For example this exercise helped us to identify C19 un- 
der the brightest cluster galaxy (BCG) in A 68 (Fig.|2j. The resid- 
ual light from the bright galaxies in these unsharp-masked frames 
inevitably brightens the surface-brightness limit of the multiple- 
image search close to the cores (central few arcsec) of the sub- 
tracted galaxies. However, the only images that we expect to find 
in such locations are strongly de-amplified counter-images, the 
brighter images of which should be easily detectable elsewhere in 
the frame, if they lie above the surface brightness detection limit. 
We therefore expect the search for multiple-image systems to be 
reasonably complete to ^^702— 25 mag arcsec^^. 

We list the multiple-image candidates in Table |2| and mark 
them in Fig. |2| Faint sources detected in the cluster cores are ex- 
cluded from Table |2| if they are not plausibly multiply-imaged, 
based on morphological grounds, including examination of issues 
relating to the parity of possible counter images (see Smith 2002 
for a more detailed discussion of issues relating to the identification 
of multiple-image candidates). The clusters are sub-divided in Ta- 
ble |2| into those with spectroscopically confirmed multiple-image 
systems (top) and those for which no spectroscopic identifications 
of genuine multiple-image systems is yet available (bottom). Each 
sub-set of clusters contains half of the total sample of 10. Of the 
five clusters without any spectroscopically confirmed multiples, 
two contain convincing strong-lensing candidates: A 267 (E2) and 
A 1835 (K3). A firm lower limit on the fraction of clusters in this 
sample that contain a core region with a projected mass density 
above the critical density required for strong-lensing is therefore 
50%, although values as high as ~70-80% are also plausible. 

Table|2|also lists the spectroscopic redshifts that are available 
from other articles in this series (Smith et al. 2001, 2002b), and the 
published literature. We refer the interested reader to these articles 
for the details of the spectroscopy and multiple-image interpreta- 
tion. Note that some of the previously published multiple-image 
identifications were based on ground-based data, and therefore suf- 
fered from quite severe uncertainties. We discuss in ^4] improve- 
ments to the interpretation of the data that are now possible using 
the HST data presented here. We also present below new spectro- 
scopic identifications of four multiple-image candidates, recently 
obtained with the Keck and Subaru telescopes. 

3.2.1 Keck-I/LRIS Observations of A 68 

On November 30, 2002, we conducted deep multi-slit spectroscopy 
with the Low Resolution Imager Spectrograph (LRIS; Oke et al 
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Figure 3. One-dimensional .spectrum of C4 in A 68, obtained with LRIS. 
We interpret the single strong emission line as Ly-a at 2=2.625 (see i|3.2l 
for more details). 

1995) on the Keck-I telescope®, on the cluster A 68. The night had 
reasonable seeing, ~0.8", but was not photometric (with some cir- 
rus), thus no spectrophotometric standard stars were observed. A 68 
was observed for a total of 7.2 ks using the D680 dichroic with the 
600/7500 grating on the red side and the 400/3400 grism on the 
blue side. On the red side the spectral dispersion was 1.28A pixel" ^ 
with a spatial resolution of 0.214" pixeP^, and on the blue side, the 
spectral dispersion was 1.09A pixeP^ with a spatial resolution of 
0.135" pixen^ using the blue sensitive 2kx4k Marconi CCDs. 

Three multiple-image candidates were targetted in the mask: 
COab, Clc and C4 (Table |2}. Only C4 has a strong spectral fea- 
ture - a single strong emission line at Aobs=4404.7 A (Fig.|3}. We 
interpret this line as Ly-a A1216A at 2=2.625. The only other 
possibility would be [Oil] at 2=0.18, i.e. in front of the cluster. 
We consider this the less likely option given the apparent tangen- 
tial distortion of the arclet with respect to the cluster center, and 
the presence of C3 and C20 which appear to be lensed galaxies at 
a similar redshift to C4 (Fig.|2}. We note however, that it appears 
these three arclets are each single images of different background 
galaxies. 

3.2.2 Subaru/FOCAS Observations of A 2219 

On May 29-30, 2001, we conducted deep multi-slit spectroscopy 
of A2219 with the Faint Object Camera And Spectrograph (FO- 
CAS; Kashikawa et al 2002) on the Subaru 8.3-m telescope^'^. 
The two nights had reasonable seeing, FWHM~0.8", but were 

® The W.M. Keck Observatory is operated as a scientific partnership among 
the California Institute of Technology, the University of California, and 
NASA. 

^" Based on data collected at the Subaru Telescope, which is operated by 
the National Astronomical Observatory of Japan. 
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Table 2. Multiple-image Candidates and Spectroscopic Redshifts 



Cluster 


Candidate 


Redshift" 


References 


Notes 
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Nl2 




PI 






Disk galaxy; edge of disk is counter image of PO 


N3 




P2a/b/c 


2.730 


13.14.jl2l 




L123 




P3/P4 


3.666 


14.ilX2l 




A, C 




P5 




14.jl2l 


Counter image of P3/P4 


B 




P6/P7/P8 


2.5±0.2 




Faint pair (P6/P7) plus counter-image (P8) 






P9/P10 


1.3±0.2 




Candidate pair adjacent to PO 






P11/P12 


1.5±0.3 




Faint pair 




Clusters with only Candidate Multiple-images [5/101 


A 209 


DO 






Faint arclet - singly imaged? 






Dl 






Asymmetric morphology - singly imaged? 






D2 






Disturbed morphology - singly imaged? 




A 267 


El 


0.23 


15 


Cluster member 






E2a/b 






Faint image pair; counter-image not detected. 




A 773 


FO 


0.650 


3 


Singly-imaged? 






F13 


0.398 


3 


Singly-imaged? 






F18 


0.487 


3 


Singly-imaged? 






F19 


0.425 


3 


Singly-imaged? 




A 1763 


J4 




1 


Singly imaged or merging pair? 


EROJ 133521+4100.4 


A 1835 


KO 




16 


Radial feature - associated with BCG? 


A 1835-B' 




Kl 




16 


High surface brightness arclet - singly-imaged? 


A 1835-B 




K2 






Faint arclet - singly imaged? 






K3 




16 


Low surface brightness blue arcs 


A1835-A 



" Redshift stated with an eiTor bar are inferred from the lens model of the relevant cluster. 
' See Smith et al. (2001) for a full hst of candidate multiples in A 383. 

See Kneib et al. (1996; 2004, in prep.) for a full list of candidate multiples in A2218. 
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Figure 2. Zooms into the central region of seven of the clusters showing the confirmed and candidate multiple-image systems discussed in the text. The black 
curves follow the 2=1.60, 5.4, 2=1.01 and 2=0.771 tangential critical curves for A68, A383 and A963 respectively as computed from the lens models. 
The bar at the bottom left of each panel shows a physical scale of 50kpc. The orientation of each panel matches the corresponding panel in Fig.0 tick marks 
are spaced at 5" intervals. 
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Figure 2. [continued] Zooms into tlie central region of A 773, A 221 8 and A 2219, showing the confirmed and candidate multiple-image systems discussed 
in the text. The curves follow the 2=0. 702 and 2=1.069, 3.666 tangential critical curves for A2218 and A2219 respectively, as computed from the lens 
models. The bar at the bottom left of each panel shows a physical scale of 50kpc. The orientation of each panel matches the corresponding panel in Fig.lTI 
tick marks are spaced at 5" intervals. 
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\esl (/^"l) [z= 1-069] A,est (Mm) [z = 2.730] A^^^, (/im) [z = 3.6B6] 




Figure 4. One-dimensional spectra of PO (left), P2c (center) and P3 (right) multiple-image systems in A 2219, obtained with the FOCAS spectrograph on 
Subaru. The vertical dotted lines in each panel mark the features used to identify the redshift of each system; the details of these identifications is discussed in 

£21 



not fully photometric (with some cirrus), nevertheless we obtained 
a crude flux calibration using the spectrophotometric standard star 
"Wolf 1346". 

We observed A 2219 for a total of 12.6 ks using the Medium 
Blue (300B/mm) grism and the order sorting filter Y47. We used 
the MIT 2kx4k CCD detector with a binning factor of 3 in x and 
2 in y, this gives us a spectral dispersion of 2.8A pixeP^ and a 
spatial resolution of 0.3" pixel" ^. Four multiple image candidates 
were targetted in the mask: PO and P2c had previously been identi- 
fied by Small et al. (1995) as gravitational arcs, and P3/P4 had been 
identified by Bezecourt et al. (2000) as lying at 2:=3.6±0.4 using 
photometric redshift techniques. We list the results of our observa- 
tions (see also Figure|4}: 

(i) PO is identified as a 2:=1.069±0.001 star-forming galaxy 
showing a strong [Oil] A3727A emission plus weak Balmer 
and Calcium lines. 

(ii) P2c is identified as a 2:=2.730±0.001 galaxy using the fol- 
lowing interstellar metal absorption lines: OI A1302.17A, 
SilV A1393.7, 1397.0A, Fell A1608.45A, CI A1656.93A, and 
AlII A1670.79A. 

(iii) P3 is identified as a 2;=3.666±0.001 galaxy using a broad 
Ly-a absorption feature and the metal absorption lines 01 
AI302.17A, SilV A1393.7, 1397.0A, plus CIV A1548.2, 
1550.77A in emission with a broad absorption feature on the 
blue side. 

(iv) P4 was also observed, although, it appears that the slit was not 
well aligned with the target galaxy, possibly due to a mask- 
milling problem. We do however detect an absorption feature 
in these data at the same wavelength as the Ly-a absorption 
feature in P3. It therefore appears that P4 is also at 2=3.666. 

We interpret PO as a pair of merging images straddling the 
2=1.07 critical line. Small et al. (1995) proposed that PI is the 
counter-image of this pair, however Bezecourt et al. (2000) ar- 
gued against PI because its optical/near-infrared colours are redder 
than those of PO. When constraining the lens model of this cluster 
with just this multiple-image system, several alternative counter- 
images provided plausible fits. However when this constraint was 
combined with other multiple-image systems, especially P3/P4/P5 
which also lies in the saddle region between the BCG and the group 



of galaxies to the South-West, an acceptable model was only pos- 
sible if PI is identified as the counter-image of PO. The contradic- 
tion between this result and Bezecourt et al.'s (2000) photometry is 
eliminated by the superb spatial resolution of the HST data, because 
it reveals that PI is a disk galaxy, the Southern portion (presumably 
part of the disk) of which has a surface-brightness consistent with 
that of PO. This is confirmed by inspection of a colour image of this 
field based on Czoske's (2002) B J?7-band CFHI2k imaging of this 
cluster, which reveals that the Southern envelope of PI is also bluer 
than the central region, and is consistent with this interpretation. 

The spectroscopic identifications of P3 and P4 confirm 
Bezecourt et al.'s (2000) results. We identify P5 as the third image 
of this galaxy. 

3.3 Source Extraction and Analysis 

In addition to the multiple-image constraints described in the pre- 
vious section, we also need to construct catalogs of cluster galaxies 
and faint, weakly lensed galaxies. The former are incorporated into 
the gravitational lens models (S|4j as galaxy-scale perturbations to 
the overall cluster potential. The latter supplement the multiple- 
image systems to further constrain the parameters of the lens mod- 
els. 

The first step toward the cluster and background galaxy cat- 
alogs is to analyze the HST frames using SExtractor (Berlin & 
Arnouts 1996). We selected all objects with isophotal areas in ex- 
cess of 7 pixels (0.017 arcsec*^) at the /i702 =25.2 mag arcsec"^ 
isophote (1.5-a"/pixel). All detections centroided within 3" of the 
edge of the field of view, and within regions affected by diffrac- 
tion spikes associated with bright stars are discarded, leaving a to- 
tal of 8,730 "good" detections, of which 193 are classified as stars. 
We estimate from the roll-over in the number counts at faint lim- 
its, and Monte Carlo simulations of our ability to recover artificial 
faint test sources with SExtractor, that the 80% completeness limit 
of the HST frames is i?702 — 26. We also use a simple model that 
combines the behaviour of deep J?-band field galaxy counts (e.g. 
Smail et al. 1995b; Hogg et al. 1997) with a composite cluster lu- 
minosity function with a faint end slope of a=l (e.g. Adami et al. 
2000; Goto et al. 2002; De Propris et al. 2003) to determine at what 
magnitude limit to divide the source catalogs into bright and faint 
sub-samples. We adopt i?702=22, which corresponds to 3.5 mags 
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r(arcsec) Area (pixels) 

Figure 5. left — Mean tangential shear, Tu^raw, of the high signal-to-noise star sample as a function of radial distance from the stacked chip center. 
The dashed line shows the polynomial fit to these data that we used to remove the PSF anisotropy. RIGHT — The results of Monte Carlo simulations to test 
how accurately we can measure the ellipticity of faint galaxies. The measured ellipticity as a fraction of the true elhpticity is well-fitted by a second-order 
polynomial function (dashed curve). When the galaxy image is smaller than ~30 pixels, the shape measurements are overwhelmed by the PSF and the effects 
of pixelization. We therefore cut our faint galaxy catalogs at a "faint" limit of area^SO pixels. We also use the polynomial function plotted in this figure to 
connect the observed ellipticities to intrinsic ellipticities. 



fainter than an L* galaxy at the cluster redshift. We estimate con- 
servatively that 20% of the sources fainter than this hmit may be 
cluster galaxies, thus contaminating the sample used for the weak- 
lensing constraints. In H4.2l we verify that this contamination has a 
negligible effect on our results. 



3.3.1 Cluster Galaxies 

The mass of the galaxy-scale mass components in the lens models 
generally scale with their luminosity (see Appendix for details). We 
therefore apply two corrections to the i?702-band MAG_BEST mag- 
nitudes of the bright galaxies ( i|3.3> to obtain robust measurements 
of the luminosities of the cluster galaxies. 

Balogh et al. (2002) fitted parametrized bulge and disk surface 
brightness profiles using GIM2d (Simard 1998) to the cluster galax- 
ies in this sample. We compare the SExtractor MAG_BEST values in 
our bright galaxy catalogs ( il3.3> with Balogh et al.'s surface pho- 
tometry of the same galaxies, finding that MAG_BEST is fainter than 
the corresponding GIM2d magnitude. Typically Ai?702~0. 1-0.2, 
increasing to Ai?702~0.5-1.5 for the brightest cluster members 
including the BCGs. These differences arise because SExtractor 
over-estimates the sky background for the brighter and thus larger 
cluster galaxies, because as the size of these galaxies approaches 
the mesh size used for constructing the local background map, 
source flux is absorbed into the background. A second problem oc- 
curs in crowded cluster cores. When a smaller galaxy is de-blended 
from a brighter galaxy, SExtractor often incorrectly associates pix- 
els from the brighter galaxy with the fainter, thus over-estimating 
the flux from fainter and under-estimating the flux from brighter 
galaxies. We therefore adopt Balogh et al.'s surface photometry as 
the total _R702-band magnitudes of the cluster galaxies. 

Optical photometry is more sensitive to ongoing star forma- 
tion in cluster galaxies than NIR photometry. To gain a more re- 
liable measure of stellar mass in the cluster galaxies we therefore 



exploit K-hw\d imaging of the cluster fields, obtained as part of 
our search for gravitationally-lensed EROs (Smith et al. 2002a), 
to convert the total i?702-band magnitudes to total K-hw\A magni- 
tudes. We subtract the 2" aperture {R702—K) colours of the cluster 
galaxies measured by Smith et al. (2002a) from the total 7?702-band 
magnitudes to obtain total A"-band magnitudes. Finally, we con- 
vert these magnitudes to rest-frame K-band luminosities, adopting 
Af^=(-23.38±0.03)-h51og h (Cole et al. 2001) and Mk0=3.39 
(Johnson 1966; Allen 1973) and estimating /^'-corrections from 
Mannucci et al. (2001). Summing in quadrature all of the uncer- 
tainties arising from these conversions, we estimate that the K- 
band luminosity of an L* galaxy is good to 10%. 



3.3.2 Faint Galaxies 

In this section we develop and apply several corrections to recover 
robust shape measurements of the faint galaxies for use as weak- 
lensing constraints on the cluster mass distributions. The goal of 
these corrections is to remove any artificial enhancement or sup- 
pression of image ellipticities from the faint galaxy catalogs. Such 
effects arise from the geometry of the focal plane, isotropic and 
anisotropic components of the PSF and pixelization of faint galaxy 
images. A correction for the geometric distortion of the focal plane 
was applied in the data reduction pipeline using Trauger et al.'s 
(1995) polynomial solution ( i|3.H . We deal with the remaining is- 
sues in turn below. 

The HSTfWFPCl PSF varies spatially and temporally at the 
~10% and ~2% levels respectively (Hoekstra et al. 1998; Rhodes 
et al. 2000). We ignore the temporal component because, as we 
demonstrate below using simulations, point source ellipticities of 
~2% are comparable with the noise on the shape measurements. 
We examine the spatial variation of the PSF in the ten WFPC2 
frames, however each field contains just ~6 suitable isolated, high 
signal-to-noise, unsaturated stars. We therefore exploit archival 



© 0000 RAS, MNRAS 000, 000-000 



HSTfWFPCl observations of a further eight low luminosity clus- 
ters (Lx^lO**[0.1-2.4keV]ergs-^) at z~0.25 (C10818+56, 
CI 0819+70, CI 0841+70, CI 0849+37, CI 1309+32, CI 1444+63, 
CI 1701+64 and CI 1702+64) that were observed in an identical 
manner to our Cycle 8 observations (Balogh et al. 2002). These data 
were processed identically to the primary science data and strin- 
gent selection criteria applied to the combined dataset to construct 
a sample of 103 stars from which to derive a PSF correction. 

The PSFs of these stars are tangentially distorted with respect 
to the center of each WFC chip with the magnitude of the distortion 
increasing with distance from each chip center. The tangential shear 
at the edge of each chip is '-^5-10%, falling to <l-2% at each chip 
centre, and the variation in distortion pattern between the chips is 
negligible, in agreement with Hoekstra et al. (1998) and Rhodes et 
al. (2000). We therefore stack the three chips to derive a global so- 
lution by fitting a second order polynomial to the tangential shear as 
a function of distance from the chip center (Fig.|5}. After applying 
this correction to the entire star sample from all 18 cluster fields, 
the median tangential shear of the stars is reduced to the same level 
as the radial stellar shear i.e. <l-2%. We interpret the residuals 
as random noise, and test this hypothesis using Monte Carlo sim- 
ulations. We insert 10^ stellar profiles that are sheared in small in- 
crements between zero and 10% into random blank-sky positions 
in our science frames. We then run the same SExtractor detection 
algorithm as described in il3.3l on these frames. These simulations 
reveal that the position angle of a nearly circular stellar source can 
only be measured to <10% accuracy if the ellipticity of the source 
is ^2.5%, thus confirming our interpretation of the residuals. We 
use the results of this analysis to correct the shape measurements 
in the faint galaxy catalogs. The corrections all result in a change 
of <,0-02 in the final weak shear constraints listed in Table|3|- i.e. 
smaller than or comparable with the statistical uncertainties. 

We also use Monte Carlo simulations to estimate the minimum 
number of contiguous pixels required for a reliable shape measure- 
ment (<10% uncertainty). We first select a sample of relatively 
bright (_R702~19-21) background galaxies to use as test objects, 
ensuring that these galaxies cover the observed range of elliptici- 
ties in deep field galaxy surveys (e.g. Ebbels 1998). We scale and 
insert these test objects into random blank-sky positions in the sci- 
ence frames and attempt to detect them by running SExtractor in 
the same configuration as used in i|3.3l We perform ~10'' realiza- 
tions spanning the full range of expected apparent magnitudes and 
scale sizes of faint galaxies (e.g. Small et al. 1995b). The mea- 
sured ellipticity declines markedly as a fraction of the input ellip- 
ticity for sources with areas smaller than '-^l, 000 pixels. Also, the 
smallest galaxy area for which the uncertainty in its shape mea- 
surement is <10% is ~30 pixels. This limit represents, for the ex- 
pected ellipticity distribution (0^r<1.5 - Ebbels 1998), the min- 
imum galaxy size for which both the minor and major axes are 
resolved by HSTfWFFCl. We therefore adopt 30 contiguous pix- 
els as the "faint" limit of our background galaxy catalogs. We also 
fit a second order polynomial to the simulation results in the range 
SO^area^lO^ pixels (Fig.|5}, and use this recovery function to 
correct the measured ellipticity of each source in the faint galaxy 
catalogues. 

3.4 Summary of Lens Model Constraints 

In this section we briefly re-cap the strong-lensing constraints and 
then describe how the faint galaxy catalogs constructed in i|3.3.2l 
are converted into constraints on the cluster mass distributions. 
The multiple-image systems (Table |2j comprise two cate- 
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Table 3. Summary of Model Constraints 



Cluster 


Multiple-image 


Weak- 


-shear Measurements 




Systems" 


-f^min 

(kpc) 




(tu)'' 


A 68 
A 209 


C0/[C19], [CI], [C2], 
[C6/C20], C15/C16/C17 


200 
100 


343 
431 


0.18+0.03 
0.10+0.02 


A 267 


[E2] 


200 


323 


0.06+0.02 


t\ JO J 


RO/T? 1 /RA rR7/R'^/R 1 71 


1 fin 


357 


0.12+0.02 


A 773 




200 


297 


0.19+0.03 


A 963 


HO 


120 


455 


0.13+0.02 


A 1763 




150 


399 


0.09+0.02 


A 1835 




300 


190 


0.20+0.03 


A2218 


MO, M1,M2, [M3] 


250 


187 


0.16+0.03 


A2219 


P0/P1,P2, P3/P4/P5, 
[P6/P7/P8], [P9/P10], 
[P11/P12] 


200 


246 


0.15+0.03 



" Unconfirmed systems are listed in parenthesis. 
* (tu) is the mean tangential shear of the faint background galaxies 
selected for inclusion in the weak-shear constraints. We use the shape 
and orientation of each galaxy as an individual constraint on the lens 
model, and here summarize the strength of these constraints by listing 
(tu) for each cluster. 

gories: confirmed and unconfirmed. Confirmed multiples have a 
spectroscopic redshift and all the counter-images are either iden- 
tified or lie fainter than the detection threshold of the observations. 
The morphology of candidate multiples strongly suggests that they 
are multiply-imaged, but the redshift of these systems is less well 
defined (Az^O.l) and not all counter-images may be identified. 
The confirmed systems provide constraints on both the absolute 
mass of the clusters and the shape of the underlying mass distri- 
bution; in contrast, the unconfirmed systems place additional con- 
straints on the shape of the cluster potential. Both categories of 
multiple-image constraints probe only the central _R<50-100kpc 
of each cluster. Therefore, to extend the constraints to larger radii, 
we supplement the strong-lensing constraints with the weakly- 
sheared background galaxies. 

To use the weakly-sheared galaxies as model constraints, we 
need to estimate their redshifts. For that purpose, we use the Hub- 
ble Deep Field North (HDF-N) photometric redshift catalog of 
Fernandez-Soto et al. (1999). The A'^pixcisJ5=30 limit developed in 
il3.3.2l is equivalent to a magnitude limit of 7?702;^26. Using a 
simple no-evolution model (King & Ellis 1985) we estimate that 
a typical galaxy at 2;~0.5-1.5 has a colour of (_R702— /814)— 0.7 
in the Vega system; converting to AB magnitudes, this translates 
into a faint limit of /8i4,ab^25.8 in the HDF-N catalog. The me- 
dian redshift to this limit is Zmcdian=0.9, with an uncertainty of 
^0.2, stemming from the dispersion in galaxy colours at 2~0.5- 
1.5 and uncertainty in the conversion between the A^pixeisi^SO and 
^2702^26. We therefore adopt 2=0.9+0.2 as the fiducial redshift 
of the faint galaxy catalogs. Note that the uncertainty in the median 
redshift contributes just 10-20% of the total error budget, which is 
dominated by the statistical uncertainty in the shear measurements. 
For each cluster we also examine the region occupied by multiple- 
image systems in the HST frames and choose a minimum cluster- 
centric radius (-Rmin) for the inclusion of faint galaxies in the model 
constraints. We test the robustness of these choices by perturbing 
the -Rmin values by ±10" to ensure that the mean weak-shear com- 
puted from the galaxies lying exterior to -Rmin is insensitive to the 
perturbation in -Rmin. 

We summarize the strong- and weak-lensing model con- 
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straints in Table|3| which is the key output from 311 First it defines, 
on the basis of our analysis of the HST data and ground-based spec- 
troscopic follow-up which of the multiple-image constraints can 
be used to calibrate the absolute mass of the clusters, and which 
may be used only for constraining the shape of the cluster poten- 
tials. Second, it lists how many faint galaxies, from what observed 
regions of the clusters have been carefully selected to provide the 
weak-lensing constraints. The strength of the weak-lensing signal 
is also listed as the mean tangential shear, (tu). In the next section 
we explain how we use these constraints to model the distribution 
of mass in the cluster cores. 



4 GRAVITATIONAL LENS MODELLING 

We use the LENSTOOL ray-tracing code (Kneib 1993) supple- 
mented by additional routines to incorporate weak-lensing con- 
straints (Smith 2002) to build detailed parametrized models of the 
cluster mass distributions. We refer the interested reader to Ap- 
pendixIXIfor full details of the modelling method. Here, we explain 
the modelling process in more general terms for the non-lensing- 
expert reader whom we assume would prefer not to be distracted by 
the many technical details which may be found in the Appendix. 

Each model comprises a number of parametrized mass com- 
ponents which account for mass distributed on both cluster- and 
galaxy-scales. The cluster-scale mass components represent mass 
associated with the cluster as a whole i.e. DM and hot gas in the 
ICM. The galaxy-scale mass components account for perturbations 
to the cluster potential by the galaxies. 

A x^^stimator quantifies how well each trial lens model fits 
the data, and is minimized by varying the model parameters to ob- 
tain an acceptable (x^/dof~l) fit to the observational constraints. 
This is an iterative process, which we begin by restricting our at- 
tention to the least ambiguous model constraints (i.e. the confirmed 
multiple-image systems) and the relevant free parameters. For ex- 
ample, in a typical cluster lens there will be one spectroscopically- 
confirmed multiple-image system and a few other candidate mul- 
tiples. The model fitting process therefore begins with using the 
spectroscopic multiple to constrain the parameters of the main 
cluster-scale mass component. Once we have established an ac- 
ceptable model using the confirmed multiple-image systems, we 
use this model to explore the other constraints and to search for 
further counter-images. Specifically, we test the predictive power 
of the model and use this to iterate towards the final refined model. 
At each stage of this process we incorporate additional constraints 
(e.g. faint image pairs) and the corresponding free parameters (e.g. 
the ellipticity and orientation of key mass components, or the veloc- 
ity dispersion of cluster galaxies that lie close to faint image pairs) 
into the model. 

4.1 Construction of the Lens Models 

This section describes how the method outlined above and de- 
scribed in detail in Appendix IXI was applied to each cluster in our 
sample. The parameters and the reduced of each fiducial best-fit 
model are listed in Table|4| One of us (GPS) constructed the best- 
fit fiducial lens models for 9 out of the 10 clusters. The tenth clus- 
ter, A 2218, was modelled by JPK, based on the results described 
in Kneib et al. (1995, 1996), Ebbels et al. (1998) and EUis et al. 
(2001). The estimation of uncertainties on relevant model parame- 
ters that is required to provide robust error bars on the cluster mass 
measurements ( il6.1> was performed for all ten clusters by GPS. 



A 68 — We first constrained the model with just the multiply- 
imaged ERO at z—1.6 (CO - Table identifying nine distinct 
knots of likely star-formation in each image of this galaxy. A model 
containing just one cluster-scale mass component (#1), did not fit 
these data well: x^/dof>5. We therefore added a second cluster- 
scale mass component (#2) to the North- West of the central galaxy. 
Despite the strong evidence for the presence of this mass compo- 
nent in the weak-shear map (Fig.0, no single bright cluster galaxy 
dominates the group of galaxies found in this region. We there- 
fore adopt the brightest of this group of galaxies as the center of 
Clump #2, for which we adopt a circular shape. CO places strong 
constraints on the mass required in this second clump because the 
spatial configuration of the images is very sensitive to the details of 
the bi-modal mass structure of the cluster. We find an acceptable fit 
without optimizing the spatial parameters of Clump #2. The South- 
west corner of CO straddles the z—1.6 radial caustic in this best-fit 
lens model, causing an additional, radially amplified image of this 
portion of the galaxy to be predicted. We search the HST frame 
in the vicinity of the predicted radial image, and find a faint radial 
feature (CI 9) 4" North- West of the central galaxy which is consis- 
tent with the model prediction. Further constraining the model with 
C15/C16/C17, at 2=5.4 confirms the validity of the model thus far, 
and helps to constrain the spatial parameters of the NW cluster- 
scale mass component. This model is also able to reproduce the 
observed morphology of the other candidate multiple-image sys- 
tems. 

A 209 — Given the weak constraints on this cluster from the HST 
data, we restrict our attention to a simple model in which the ve- 
locity dispersion of the central cluster-scale mass component is the 
only free parameter. 

A 267 — The important difference between this cluster and A 209 
is that it contains a candidate multiple-image pair (E2a/b). In addi- 
tion to constraining CTo and rcoro for the central cluster-scale mass 
component we therefore use this image pair to constrain the shape 
of the cluster potential. 

A 383 — We use the many constraints available for this cluster to 
determine precisely the full range of geometrical and dynamical pa- 
rameters for the cluster-scale and central galaxy mass components. 
Despite the overall relaxed appearance of this cluster, the bright 
cluster elliptical South-West of the central galaxy actually renders 
this a bi-modal cluster (albeit with very unequal masses) on small 
scales. We therefore also obtain a constraint on the velocity dis- 
persion of this galaxy (A 383 #2 in Table |4j. Sand et al.'s (2004) 
spectroscopic redshifts for Bla/b and BOb, placing them both at 
z=1.01, i.e. the same redshift as BOa, slightly modifies Smith et 
al.'s (2001) multiple-image interpretation of this cluster. However 
the parameter space occupied by this cluster is consistent with that 
of Smith et al.'s model. 

A 773 — Although no multiple-image systems have been identi- 
fied yet in this cluster, the large number of early-type galaxies and 
the strength of the weak-shear signal suggest that this cluster is 
probably quite massive. First we use the shapes of the weakly- 
sheared galaxies (Table|3j to constrain a model that contains a sin- 
gle cluster-scale mass component centered on the BCG (A 773 #1 
in Table |4j. The best-fit velocity dispersion of Clump #1 in this 
model is ~1000 kms~^. However, the spatial structure in the 
residuals reveals that this simple model does not reproduce the 
strong shear signal observed to the North of the second brightest 
cluster galaxy and to the East of the BCG, i.e. in the saddle region 
between the BCG and the group of cluster ellipticals at the East- 
em extreme of the WFPC2 field-of-view (Fig. 0. We therefore 
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Table 4. Best-fit Parameters of the Fiducial Lens Models" 



Cluster 


Mass*" 


AR.A.'^ 


ADec.'^ 


a/b 


e 


^corc 










Component 


(") 


(") 




(deg) 


(kpc) 


(kpc) 


(km s^^) 




Individually Optimized Mass Components 


A 68 


Cluster #1 


+0.6 


-0.7 


2.2 


37 


108 


[1000] 


950 


11.6/11 




Cluster ffz 


[-45. 8J 


[+00. 4] 


l.U 


Do 


Q1 

oi 


[iUUUJ 


/ U / 






BCG 


—0.2 


0.0 


1.7 


37 


0.3 


83 


301 




A 209 


Cluster #1 


[0 01 


fO 01 

[U.UJ 


fl 91 


[431 


[501 


[1000] 


630 


0.6/1 


A 267 


Cluster #1 


[0 01 

[U.UJ 


fO 01 

[U.UJ 


2.0 


—60 


115 


[1000] 


1060 


3.6/3 


A 383 


L.luster if I 


+0.3 


+0.5 


1.13 


109 


51 


[1000] 


900 


12.8/16 




ualaxy #z 


[+14.9] 


[-16.8] 


[1.13] 




2.2 


[40] 


176 








— U.O 


1 n 1 


l.U ( 


1 Oft 


U.O 


iiU 


OiU 




A 773 


Cluster #1 


[0.0] 


[0.0] 


[1.9] 


[-38] 


[75] 


[1000] 


750 


3.6/3 




Cluster #2 


[+1.0] 


[+24.0] 


[1.8] 


[-10] 


[75] 


[1000] 


700 






Cluster ffi 


[+84.4] 


[+12.0] 


[1.0] 




[75] 


[1000] 


550 




A 963 


Cluster #1 


[0.0] 


[0.0] 


1.7 


90 


95 


[1000] 


980 


1.4/2 




BCG 


[0.0] 


[0.0] 


1.1 


[90] 


<2 


96 


320 




A 1763 


Cluster #1 


[0.0] 


[0.0] 


[1.9] 


[180] 


[70] 


[1000] 


700 


5.1/3 


A 1835 


Cluster #1 


[0.0] 


[0.0] 


[1.5] 


[70] 


[70] 


[1000] 


1210 


0.7/1 


A 221 8 


Cluster #1 


+0.2 


+0.5 


1.2 


32 


83 


[1000] 


1070 


17.8/19 




Cluster #2 


[+47.0] 


[-49.4] 


1.4 


53 


57 


[500] 


580 






Galaxy #3 


[+16.1] 


[-10.4] 


[1.1] 


[70] 


<2 


65 


195 






Galaxy #4 


[+4.8] 


[-20.9] 


[1.4] 


[-23] 


<2 


77 


145 






BCG 


+0.3 


+0.1 


1.8 


53 


<3 


136 


270 




A2219 


Cluster #1 


+0.1 


+0.2 


1.7 


35 


77 


[1000] 


902 


3.7/3 




Cluster #2 


[+39.2] 


[-32.0] 


[1.1] 


[8] 


55 


375 


515 






Cluster #3 


[-22.9] 


[+4.5] 


[1.0] 




31 


365 


395 






BCG 


[0.0] 


[0.0] 


[1.6] 


[29] 


<3 


120 


278 





Luminosity Scaled Mass Components 


L*^ galaxy^ 


0.2 


30 


180 



" Parameter values listed in parenthesis were not free parameters. 

^ Individually optimized mass components are numbered and identified as being cluster- or galaxy-scale. 

The position of each mass component is given relative to the optical centroid of the central galaxy in each cluster (TablelTl. 

Cluster galaxies are included in the lens models down to the limit where the mass of additional components would be comparable with the 
uncertainties in the overall cluster mass, which equates to a magnitude limit of Ki^K*+2.5. 



introduce two more cluster-scale mass components: A 773 #2 is 
coincident with the second brightest cluster elliptical and A 773 #3 
coincides with the brightest member of the Eastern group of galax- 
ies. This model faithfully reproduces the global shear strength, and 
crucially it also reproduces the spatial variation of the shear and 
thus provides a superior description of the cluster potential than the 
initial simple model. 

A 963 — HO provides a straightforward yet powerful constraint on 
the potential of this relaxed cluster, enabling the dynamical and 
spatial parameters of both the cluster-scale and BCG mass compo- 
nents to be constrained. 

A 1763 — This cluster is similar to A 209 in that there are no con- 
firmed multiple-image systems and the weak-shear signal is rel- 
atively low (Table |3}. We therefore fit a model that contains the 
velocity dispersion of the (single) cluster-scale mass component as 
the only free parameter. Overall, this simple model is an acceptable 
fit to the global weak-shear signal, however it fails to reproduce 
the large observed shear signal to the West of the central galaxy 
(Fig.0. We interpret these residuals as a signature of substructure 
in this cluster, indicating that the mass distribution may be more 
complex than a single cluster-scale mass plus galaxies. Unfortu- 



nately the weak-shear signal is not strong enough to place any fur- 
ther constraints on this cluster at this time. 

A 1835 — The multiple-image interpretation of Schmidt et al. 
(2001) is ruled out by the new WFPC2 data presented in this pa- 
per, specifically, the differences in surface brightness between KO, 
Kl and K3. The absence of multiple-image constraints therefore 
results in a model similar to those of A 209 and A 1763, with just a 
single free parameter - the central velocity dispersion of the central 
cluster-scale mass component. 

A 2218 — The model of A 22 18 builds on the models published 
by Kneib et al. (1995; 1996) and incorporates for the first time the 
spectroscopic redshifts of the M2 (Ebbels et al. 1998) and M3 (El- 
lis et al. 2001) multiple-image systems. In addition to the central 
cluster-scale mass component (A 2218 #1), this model contains a 
second cluster-scale mass component (A 22 18 #2) centered on the 
second brightest cluster galaxy which lies South-East of the BCG. 
The velocity dispersion and cut-off radius of the two bright cluster 
galaxies (A 2218 #3 & #4) that lie adjacent to the MO multiple- 
image system are also included as free parameters. 

A 2219 — We first attempt to find an acceptable solution that is 
based on a single cluster-scale mass component centered on the 
BCG, constrained just by PO. This model succeeds in reproducing 
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the straight morphology of PO (Fig.|2j, however when P3/P4/P5 are 
added to the constraints, the fit deteriorates substantially. We there- 
fore add a second cluster-scale mass component (A 2219 #2) at the 
position of the second brightest cluster galaxy (South-East of the 
central galaxy). The second clump improves the fit somewhat, but 
the tight constraints from these two multiple-image systems on the 
saddle region between clumps #1 and #2 necessitate the addition of 
a third cluster-scale component (A 2219 #3 - see Fig.0. This tri- 
modal model is a good fit, and readily accommodates the additional 
constraints from P2a/b/c with a minimum of further modifications. 
This best-fit model is also able to reproduce faithfully the details 
of the candidate multiple-image systems. 



4.2 Calibration of Weak Lensing Constraints 

We investigate the systematic uncertainty that may arise as a result 
of confirmed multiple-image systems not being available for all 
of the clusters. First, we focus on the five clusters for which both 
multiple-image and weak-shear constraints are available (A 68, 
A 383, A 963, A 2218, A 2219). We ignore the multiple-image con- 
straints and construct a model of each of these clusters using just 
the weak-shear information. In common with the five lens mod- 
els that are based solely on weak-shear constraints (A 209, A 267, 
A773, A 1763, A 1835) we find that the weak-shear signal alone 
can generally only constrain one free parameter (cto) per cluster- 
scale mass component. Individually, the velocity dispersions of 
the cluster-scale mass components in the weak-shear constrained 
models agree within the uncertainties with the velocity disper- 
sions obtained in the multiple-image constrained models. How- 
ever, when treated as an ensemble, the mean ratio of weak-shear 
constrained velocity dispersions to multiple-image constrained ve- 
locity dispersion is 0.94±0.04. Based on just five clusters, it there- 
fore appears that Go for the cluster-scale mass components in the 
models of weak-shear only clusters may be under-estimated, on 
average, by ~6%. Mass scales as cto^; this possible systematic er- 
ror in (Jo therefore translates into a possible ~12% under-estimate 
in cluster mass. 

This uncertainty probably arises from contamination of the 
faint background galaxy catalogues by faint cluster galaxies, which 
we estimated conservatively to be ~20% in i|3.3l Our cross- 
calibration of the strong- and weak-lensing constraints therefore 
suggests that the contamination is somewhat lower than previously 
thought. We choose not to correct the parameters of the weak-shear 
constrained models for this effect because the uncertainties in these 
models are dominated by the statistical uncertainty which is typi- 
cally Ao-o~10-20%. A global 6% correction to the velocity disper- 
sions of the weak-lensing constrained cluster lens models would 
also neglect the dependence of the contamination, for a given clus- 
ter, on the optical richness of that cluster. This uncertainty has a 
negligible affect on the results that rely on absolute cluster mass 
measurements ( il6.2t 



5 X-RAY DATA AND ANALYSIS 

We complement the detailed view of the distribution of total mass 
in the cluster cores that is now available to us from the lens models 
with high-resolution X-ray observations with Chandra. The pur- 
pose of including these data is to compare the underlying mass 
distribution derived from lensing with the properties of the ICM. 
Specifically, we wish to compare the mass and X-ray morphologies 



Table 5. Summary of Archival X-ray Observations 



Cluster 


Obs. 


Tcxp 


kTx, tot" 


^'^X ,ann 




ID No. 


(ks) 


(keV) 


(keV) 




jZJU 




9.5_Q 7 


9-3-1.0 


A 209 


522 


10.0 


a A+0-5 
8-4-0.5 


Q y+0.6 


A 267 


1448 


6.4 


K n+0.5 
5-9-0.4 


R n+0.7 


A 383 


524 

2320 

2321 


7.4 
17.9 
14.3 


4.3«:? 


^ 9+0.2 


A 773 


533 


11.3 


8.0«;i 


o 9+0.5 


A 963 


903 


3.6 


y O+0.3 

'-■^-0.3 


7 9+0.3 
'-^-0.3 


A1763'= 


801049 


18.0 


8 Q+0-5 
o-»_0.4 




A 1835 


496 


10.5 


7 7+0.3 
'-'-0.2 


9.3«-j 


A2218 


1454 
553 


9.7 
5.4 






A2219 


896 


42.3 


14 0+"-* 


i3.8t«:? 



" kTx tot is measured in an aperture of radius il^2Mpc 

* A;Tx,anii is measured in an annulus defined by 0.1S5-RsJ2Mpc 

A 1763 has not been observed by Chandra. The ID number and expo- 
sure time listed for this cluster relate to the archival ROSAT data avail- 
able for this cluster, and the temperature is from Mushotzky & Schaif 's 
(1997) analysis of ASCA data. 

of the clusters, and to explore how the lensing-based mass mea- 
surements are correlated with the X-ray temperature of the clusters 

We therefore exploit archival Chandra data for nine of the 
clusters (Table In the spectral and imaging analysis we used 
only chips I0-I1-I2-I3 and chip S3 for observations in ACIS-I and 
ACIS-S configurations respectively. All of the Chandra observa- 
tions were performed in ACIS-I configuration except A 383 (ID: 
2321), A 963 and A 1835 which were observed in ACIS-S con- 
figuration. To reduce the data we used the procedures described 
by Markevitch et al. (2000), Vikhlinin et al. (2001a), Markevitch 
& Vikhlinin (2001), and Mazzotta et al. (2001). We note that the 
three observations of A 383 were not all performed in the same 
configuration. The spectral response and background for each ob- 
servation was therefore generated individually before combining 
the data. The data were also cleaned for the presence of strong 
background flares following the prescription of Markevitch et al. 
(2000a). The net exposure time for each observation is listed in Ta- 
ble |5] Adaptively smoothed flux contours are also over-plotted on 
the HST frames in Fig.Q 

Spectral analysis was performed in the 0.8-9 keV energy band 
in PI channels, thus avoiding problems connected with the poor 
calibration of the detector at energies below O.SkeV. Spectra were 
extracted using circular regions centered on the X-ray centroid of 
each cluster within a radius of 2Mpc at the cluster redshift, be- 
ing careful to mask out all the strong point sources. An absorbed 
MEKAL model was used, with the equivalent hydrogen column den- 
sity fixed to the relative Galactic value (Dickey & Lockman 1990). 
The temperature, plasma metallicity, and normalization were left 
as free parameters. Because of the hard energy band used in this 
analysis, the derived plasma temperatures are not very sensitive to 
the precise value of A'^h. We list the temperature of each cluster de- 
rived from the total field of view (i.e. kTx, tot =kTx{R4:'2Mpc)) 
in Tabled 

The presence of a "cool core" (e.g. Allen, Schmidt & Fabian 
2001) could bias low the cluster temperature measurements. As 
the aim is to obtain a reliable global measurement of the cluster 
temperatures, we therefore re-measured the temperatures in an an- 
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nulus fcTx,ann=fcTx(0.1^_R^2Mpc) centered on the X-ray cen- 
troid of each cluster (Markevitch 1998). There is a significant dif- 
ference between kTx, tot and kTx.mun in just two clusters: A 383 
and A 1835 (Table|5j. Both of these clusters have previously been 
identified as containing an emission line BCG (Smith et al. 2001; 
Allen et al. 1996), which is arguably the most reliable indicator of 
central cold material in clusters (Edge et al. 1990). We also note 
that none of the seven clusters for which, within the uncertainties, 
kTx ,tot~kTx ,a.nn have previously been identified as containing a 
cool core (e.g. White et al. 1997). We list the temperature ratios, 
kTx , tot / kTx ,iLnn, in Table|6l 



6 RESULTS 

We begin with a brief review of where the preceding three sections 
of analysis and modelling have brought us toward our goals of char- 
acterizing the dynamical maturity of X-ray luminous clusters at 
z~0.2 and calibrating the high-mass end of the mass-temperature 
relation. 

The detailed gravitational lens models ( i|4. It reveal the total 
matter content of the clusters; in il6.1l we compute and analyze de- 
tailed mass maps using the best-fit models. These measurements of 
total cluster mass are complemented by the X-ray pass-band (Js) 
which reveals the details of the hot intracluster medium. In il6.1l 
we compare the spatial distribution of total mass with the spatial 
structures in the X-ray flux maps and temperature measurements 
derived from the Chandra observations. We also compare the total 
matter and ICM with the spatial distribution of stars in the clus- 
ters using the measurements of the i^-band luminosity of cluster 
galaxies estimated in i|3.3.1l In summary, the synthesis presented in 
i|6.1l aims to diagnose whether or not the clusters are dynamically 
mature using independent probes of dark matter (inferred from the 
lensing mass maps), hot intra-cluster gas and cluster galaxies. 

In il6.2l we adopt a different approach - we explore correla- 
tions between the integrated properties of the clusters. Specifically, 
we use the cluster mass. X-ray luminosity and X-ray temperature 
measurements to normalize the cluster scaling relations and to in- 
vestigate the scatter about these normalizations. A key focus of this 
exercise is to exploit the structural results from il6.1l to search for 
structural segregation in the scaling relations. This is an important 
step toward constraining the mass assembly and thermodynamic 
histories of clusters as a function of cosmic epoch, and is also rele- 
vant to pinpointing potential astrophysical systematic uncertainties 
when cluster are used in the measurement of cosmological param- 
eters. 

6.1 Mass and Structure of Cluster Cores 

We begin by using the gravitational lens models to measure the 
mass of each cluster, and to quantify the spatial distribution of that 
mass. All of the diagnostics discussed in this section are listed in 
Table|6| together with the overall diagnosis of "relaxed" or "unre- 
laxed" - we define these terms in this section. 

6.1.1 Total Cluster Mass and its Spatial Distribution 

We adopt a fixed projected aperture of i?=500kpc which is well- 
matched to the scales probed by the HST data, and measure the 
mass interior to that radius: Mtot=Af(i?^500kpc). We also want 
to characterize the spatial distribution of mass in the cluster core. 
The number of cluster-scale mass components (A^dm) in the lens 




'-K.BCc/'-K.tot 

Figure 6. Central mass fraction, Mean / Mtot versus central X-band lu- 
minosity fraction, Lx.BCG / ^K.tot (3^3- This plot reveals a remarkably 
clean separation between clusters with a mass distribution that is heavily 
dominated by the central mass components, and a stellar luminosity distri- 
bution that is dominated by the BCG. The horizontal and vertical dashed 
lines mark the divisions between high and low central mass and A'-band 
luminosity fractions respectively. See >l6.1.1l for further discussion of this 
separation. 

models sheds some light on this question (Table|4]&|6|. However, 
A^DM does not contain any explicit information about mass. We 
therefore complement this quantity by measuring A/ccn, defined as 
the projected mass within ii=500kpc that is associated with the 
centrally-located mass components, i.e. the dominant cluster-scale 
mass component and the BCG. We list the central mass fraction, 
A/ccn /A/tot, in Table |6| The uncertainties in these measurements 
are estimated by exploring the parameter space occupied by each 
lens model, identifying the family of models that satisfy A^^^l- 

The central mass fractions comprises two contributions: (i) 
cluster-scale mass components in the lens models that are associ- 
ated with massive in-falling structures, and (ii) cluster galaxies that 
are associated both with the central cluster-scale DM halo (and are 
presumably virialized) and with the in-falling structures. The cen- 
tral mass fraction therefore characterize the dominance of the cen- 
tral concentration of mass in the overall cluster mass distribution. 
The measurements listed in Table|6|(see also Fig.|6j reveal that the 
clusters fall into two categories: A 267, A 383, A 963 and A 1835 
form a homogeneous sub-sample, all with Mcon/Aftot>0.95, i.e. 
mass distributions heavily dominated by the central components; 
the remaining six all have Afccn/Mtot<0.95 and are much more 
diverse than the former sub-sample, with central mass fractions 
spanning 0.4<Mccn/Mtot <0.9. 

As an independent cross-check on this sub-classification of 
the clusters, we measure the distribution of stars in the clusters us- 
ing the K-hand luminosities of cluster galaxies described in il3.3.1l 
We divide the A'-band luminosity of each BCG (i.e. the luminos- 
ity that is spatially coincident with the central mass components) 
by the combined K-hand luminosity of all the cluster galaxies de- 
tected in each HST frame. These central A'-band luminosity frac- 
tions (I/A',BCG / Lkaoi) are listed in Table|6|and plotted versus the 
central mass fractions in Fig. |6| The central luminosity fractions 
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Table 6. Mass and Substructure Diagnostics 



Cluster N-dm" A/tot Mccn/Mtot'' iA'.BCG/iif.tot X-ray Arpcak'' kTx , tot / kTx ,^nn Overall 

(lO^* M0) Morphology (kpc) Classification 



Ado 


2 


4.4ib0.1 


U.OO±U.Ui 


U.O0±U.U/ 


Irregular 


50itl5 


1 nn+O-18 
1-00-0.13 


Unrelaxed 


A 209 


1 


1.6±0.5 


0.87±0.06 


0.38±0.06 


Irregular 


17±4 




Unrelaxed 


A 267 


1 


2.6±0.4 


0.96±0.01 


0.76±0.02 


Elliptical 


88±5 


0.98«:lt 


Unrelaxed 


A 383 


1 


3.6±0.1 


0.97±0.01 


0.66±0.03 


Circular 


<4 


n 09+0. 06 


Relaxed 


A 773 


3 


5.1±1.2 


0.41±0.20 


0.22±0.08 


Irregular 


42±8 


0-y«'-0.08 


Unrelaxed 


A 963 


1 


3.3±0.2 


0.97±0.01 


0.56±0.04 


Elliptical 


<4 




Relaxed 


A 1763 


1 


2.1±0.8 


0.90±0.05 


0.45±0.05 


Irregular 


80±20 


Unrelaxed 


A 1835 


1 


5.8±1.1 


0.97±0.01 


0.50±0.05 


Circular 


<5 


n oq+0.06 
U-83_o 04 


Relaxed 


A2218 


2 


5.6±0.1 


0.77±0.01 


0.19±0.08 


Irregular 


38±7 




Unrelaxed 


A2219 


3 


3.4±0.1 


0.85±0.01 


0.32±0.07 


Irregular 


13±4 


1 01+°-°* 


Unrelaxed 



" ^DM is the number of cluster-scale DM haloes contained in each best-fit lens model. 
' Mcen is the mass that resides in the centrally-located DM halo of the lens model and the BCG ( 86.11 . 
The uncertainties on Arpcak include uncertainties on the central coordinates of the cluster mass distribution in the relevant lens models. 



span Lx.BCc/iK, tot ~0. 2-0.8, and are not obviously more ho- 
mogeneous for low and high central fraction clusters. Nevertheless, 
there appears to be a roughly monotonic relationship between the 
central mass fraction and central JC-band luminosity fraction, thus 
to first order confirming the separation of the cluster sample into 
two structural classes. 

This sub-classification into a homogeneous sub-sample of 
clusters with Afccn/Mtot>0.95 and a diverse sub-sample with 

0. 4<AfcGn/A^tot <0.9 matches the details of the cluster lens mod- 
els reasonably well. The lens model of each of the former clusters 
contains a single cluster-scale mass component. The situation is 
less clear-cut for the latter sub-sample. Lens models of four of the 
six clusters contain two or more cluster-scale mass components, 

1. e. their mass distributions are unambiguously bi- or tri-modal 
(see also Fig.0, and thus the low central mass fractions are dom- 
inated by substructure in the cluster cores. However the remaining 
two (A 209 and A 1763) contain a single cluster-scale mass compo- 
nent. It is therefore ambiguous whether the moderately low central 
mass fractions in these clusters genuinely reflect cluster substruc- 
ture, or are simply due to the cluster galaxy populations. One pos- 
sibility is that these two clusters are both undergoing mergers in the 
plane of the sky. This would help to explain the absence of a con- 
firmed strong lensing signal (Table|2j, the low aperture mass mea- 
surements (Table |6} and the moderately low central mass and K- 
band luminosity fractions. Wider-field HST imaging would help to 
resolve this uncertainty. 



6.1.2 Total Mass Versus X-ray Flux and Temperature 

We now compare the cluster mass distributions with the X-ray ob- 
servations to gain further leverage in diagnosing the maturity of the 
full sample of 10 clusters. 

Iso-mass contours computed from the best-fit lens models 
and adaptively smoothed X-ray flux contours from the Chandra 
observations are overplotted on the HST frames in Fig.0 We also 
carefully check the pointing accuracy of the Chandra observations 
using 28' x42' panoramic CFH12k imaging of these cluster fields 
(Czoske 2002) to confirm that the Chandra astrometry matches the 
frame defined by the optical data to an rms accuracy of ^2kpc at 
the cluster redshift. We measured the offset between the position 
of the X-ray peak in each Chandra frame and list Arpcak, the off- 
set between this position and the center of mass in the mass-maps 



in Table H We compare the mass and X-ray morphologies with 
the mass and luminosity fractions discussed in i|6.1.1l Three of the 
four high central mass fraction clusters (A 383, A 963, A 1835) ap- 
pear relaxed at X-ray wavelengths, i.e. circular or mildly elliptical 
morphology. A 267, is an exception to this picture - its X-ray flux 
contours are much less regular than the other three systems, and 
there appears to be a large offset between X-ray and mass centers. 
The six low central mass fraction clusters also have irregular X-ray 
morphologies and misalignment between X-ray and mass peaks. 

We also list in Table|6|the ratio of the total to annular tempera- 
tures (kTx , tot / kTx ,a.nn) of each cluster measured in Jsjto test for 
the presence of cool cores. Eight of the clusters, comprising the six 
with low central mass fractions plus A 267 and A 963 display no 
evidence of a cool core. The absence of evidence for a cool core in 
A 267 is unsurprising given the likely dynamical disturbance in the 
cluster core as indicated by its X-ray morphology. A temperature 
ratio of unity for A 963 is also consistent with previous work on 
this cluster, which has traditionally been classified as an "interme- 
diate" cluster (e.g. Allen 1998), i.e. it appears to be quite relaxed, 
but has not acquired a cool core since (presumed) previous merger 
activity. This is also consistent with the mild ellipticity in the X-ray 
isophotes, in contrast to the almost circular isophotes of A 383 and 
A 1835. 



6.1.3 Summary 

Table |6| lists all of the diagnostics described in this section. Each 
diagnostic in isolation offers a slightly different view of the dy- 
namical maturity of each cluster. We combine all of the available 
information to determine a robust diagnosis of each cluster's ma- 
turity, and to identify the remaining uncertainties. In making the 
overall classifications listed in Table |6| we define the term "re- 
laxed" to mean that the cluster is dynamically mature in all diag- 
nostics available to us, with the exception that we do not require it 
to have a cool core. In terms of the diagnostics listed in Table|6| this 
means that A'^dm^I, Mcon/Mtot^O.95, Lk.bcg / LK,tot>0.5, 
Arpcak <4kpc, and the X-ray morphology is either circular, or 
mildly elliptical. The unrelaxed clusters do not satisfy one or more 
of these criteria. 

We therefore conclude that 7 out of the 10 clusters in our 
study, i.e. 70±20% of X-ray luminous cluster cores at 2:~0.2 are 
dynamically immature (the error bar assumes binomial statistics - 
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Figure 7. For each cluster in the sample, we show (left) the adaptively smoothed X-ray flux contours and (right) the iso-mass contours calculated from the 
best-fit lens models. The greyscales are low-resolution renderings of the HSTPNWC2 frame. All three of the clusters classified as relaxed (A 383, A 963 
and A 1835; Table|5) have very regular and centrally concentrated X-ray and mass morphologies. The seven unrelaxed clusters all have irregular X-ray 
morphologies, and four of them (A 68, A 773, A 221 8, A 2219) have bi/tri-modal mass morphologies. All contours are equally spaced in the log. 
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Gehrels 1986). Henceforth we classify A 383, A 963 and A 1835 as 
"relaxed" clusters and A68, A209, A267, A773, A 1763, A2218 
and A 22 19 as "unrelaxed" clusters (Table |6l. 

6.2 Cluster Scaling Relations 

We now investigate the scaling relations between cluster mass, tem- 
perature and X-ray luminosity, focusing on the normalization of 
and scatter around the relations and the impact of the dynamical 
immaturity of 70% of the sample identified in i|6.1l 

6.2. 1 Mass Versus X-ray Luminosity 

The sample is selected on X-ray luminosity (ij2j, we therefore be- 
gin with the mass-luminosity relation. First, we explore whether 
we can improve on the precision of the RASS-based X-ray lumi- 
nosities (Table0 using the Chandra data. One of the largest uncer- 
tainties in the luminosities quoted in TableQis that ROSAT's large 
PSF limited the efficiency with which point-sources could be ex- 
cised from the cluster data. The sub-arcsecond PSF of the Chandra 
data overcome this problem, however we find that the corrections 
for point-sources are modest and comparable with the extrapola- 
tion uncertainties that arise from Chandra's field of view which is 
too small to embrace all of the extended emission from clusters 
at z=0.2. The Chandra-hased luminosities are therefore no more 
precise than the ROSAT luminosities at this redshift. We therefore 
adopt the X-ray luminosities upon which the sample was selected 
(Table[T}. 

We plot Mtot versus X-ray luminosity in Fig. [8| De- 
spite selecting very X-ray luminous clusters for this study 
(Lx^SxlO""' ergs~^, 0. 1-2.4 keV), these data span sufficient dy- 
namic range in principal to constrain both the slope and normaliza- 
tion of the mass-luminosity relation (c.f. Finoguenov et al. 2001). 
We parametrize the mass-luminosity relation as follows: 

(Lx/lO**ergs-^)=A(Mtot/lO"M0)^/° (1) 

and try to solve for A and a following Akritas & Bershady (1996) 
to account for errors in both variables and unknown intrinsic scatter. 
Unsurprisingly, given the large scatter that is immediately obvious 
upon visual inspection of Fig. [S] this exercise fails. We therefore 
fix the slope parameter and simply measure the normalization, A. 
This is done by computing the mean mass and luminosity, and then 
solving log A=(log(Lx)) — {log(Mtot))/Q!. Uncertainties in both 
mass and luminosity are included in the calculation by repeating it 
10* times, on each occasion drawing values of Mtot and Lx ran- 
domly from the distributions defined by the error bars on X-ray 
luminosity and mass listed in TablesQand|6|respectively. Simple 
gravitational collapse models predict that q:=0.75 (Kaiser 1986), 
we therefore initially measure the normalization using this value 
for the slope, and also compute the intrinsic scatter around this 
model. These calculations are performed for the whole sample of 
ten clusters and the relaxed and unrelaxed sub-samples, and the 
results listed in Table |7| Based on these calculations, there is no 
evidence for segregation between relaxed and unrelaxed clusters in 
the mass-luminosity plane, and the scatter is ctm— 0.4. 

We repeat these calculations using an empirical determina- 
tion of the slope: a^0.76to,ll (Allen et al. 2003), drawing ran- 
domly from the error distribution on a in the same manner as de- 
scribed above for the mass and luminosity data. This has the effect 
of broadening the uncertainties on the normalizations listed in Ta- 
ble0 but does not change the overall conclusion. 



Table 7. Scaling Relations - Normalizations and Scatters 



Sample Slope" Normalization Scatter 



Mass-luminosity: L=AM'^^°' 

All a=0.75 A=2.64±0.36 (ta/=0.41 

Relaxed A=2.63±0.36 0-^=0.30 

Unrelaxed A=2.65±0.50 (tm=0.45 

All a=0.76lo:t3 A=2.53±1.74 0-^=0.41 

Relaxed A=2.50±2.05 (tm=0.30 

Unrelaxed A=2.54±1.68 (Tm=0.45 



Mass-temperature: kT=B M'^/P 

All /3=1.5 B=3.59±0.24 ctt=0.A2 

Relaxed B=2.76±0.20 crT=0.29 

Unrelaxed B=4.01±0.36 (Tt=0.43 

All /3=1.581°:ot -B=3.72±0.30 o-t=0.41 

Relaxed _B=2.88±0.26 (tt=0.29 

Unrelaxed _B=4.15±0.41 0-^=0.42 



Luminosity-temperature: kT=C L^/^ 

All 7=2 C=2.21±0.09 o-t=0.33 

Relaxed C=1.70±0.07 (tt=0.24 

Unrelaxed C=2.46±0.14 crT=0.30 

All 7=2.09j:o;2g C=2.22±0.71 crT=0.33 

Relaxed ' C=1.71±0.60 (Tt=0.24 

Unrelaxed C=2.48±0.76 cr-r=0.30 



" For each scaling relation, the first slope parameter listed is based on 
the self-similar collapse (e.g. Kaiser 1986). The second value in each 
case is taken from recent empirical measurements: Allen et al. 2003; 
Finoguenov et al. 2001; Markevitch 1998. 

6.2.2 Mass Versus Temperature 

We plot Aftot versus kTx, a.nn in Fig.|8| and parametrize the relation 
as: 

(fcTx,ann/l keV) =B (A/tot / lo" M© ) (2) 

Note that we consider kTx,a.nn, and not kTx,tot', the results de- 
scribed here are therefore robust to the presence of cool cores in 
relaxed clusters. 

We adopt the theoretically predicted slope: p—1.5, which is 
consistent with observations for the most massive clusters (e.g. 
Finoguenov et al. 2001; Allen et al. 2001). Following the proce- 
dures described above, we measure the normalization and scatter, 
and list the results in TableQ Two significant results emerge from 
this exercise. First, the unrelaxed clusters are on average 40% hotter 
than the relaxed clusters at 3-a significance, and second the scat- 
ter about the mass-temperature relation for all clusters is ctt— 0.4. 
The statistical significance of the temperature off-set is reduced to 
2.5-a if an empirically measured value of /3 is used in place of the 
theoretical value (e.g. /3=1.581q:o? " Finoguenov et al. 2001). 

6.2.3 X-ray Luminosity Versus Temperature 

Finally, we parametrize the luminosity-temperature relation as: 

(fcTx,ann/lkeV)=C(Lx/10**ergs-^)i/^ (3) 

and repeat the analysis described above. Adopting the theoretical 
value of 7=2, the measured values of C (Table indicate that 
unrelaxed clusters are 30% hotter than relaxed clusters, at 2.4- 
a significance, i.e. less significance than in the mass-temperature 
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L, (erg/s) [0.1-2.4keV] kT,c„„(keV) kT,„„^(keV) 

Figure 8. Mass-luminosity (left), mass-temperature (center) and luminosity-temperature (right) relations for our sample of ten clusters. The relaxed/unrelaxed 
clusters are shown by the circular/square symbols as explained in the legend. The solid and dashed lines show the best-fit relations normalized by the relaxed 
and unrelaxed clusters respectively; see i|6.2l and TablelTlfor further information. The error bars on each line show the uncertainty on the normalization of each 
relation. In summary, these relations show that the large scatter detected in the mass-luminosity plane appears to be symmetric; in both the mass-temperature 
and luminosity-temperature planes, the scatter appears to be asymmetric, with the uni'elaxed clusters being on average hotter than the relaxed clusters. This 
segregation is statistically insignificant in the luminosity-temperature plane, and significant at the 2-3(t level in the mass-temperature plane. 



plane. However, adopting an empirical measurement of 7 in the fit 
(7=2.09±0.29 - Markevitch 1998) eliminates the statistical signif- 
icance in this difference. Nevertheless, this hint of structural seg- 
regation in the luminosity-temperature plane (see also Fig. |6} is 
significant for two reasons. First, it provides a lensing-independent 
cross-check on the results in the mass-temperature plane, in that 
both luminosity and temperature measurements are independent of 
the lens modelling upon which the cluster mass measurements are 
based. Second, it is consistent with previous detections of structural 
segregation in the luminosity-temperature plane (e.g. Fabian et al. 
1994). 



7 DISCUSSION 

This is the first study to combine a detailed, high-resolution strong- 
lensing analysis of an objectively selected cluster sample with anal- 
ysis of a high-resolution X-ray spectro-imaging dataset. As such, 
it affords the first opportunity to combine high-quality optical and 
X-ray probes of cluster mass, structure and thermodynamics. The 
results presented in the previous section may be summarized as fol- 
lows: 

(i) 70±20% of X-ray luminous cluster cores at z=0.2 are dy- 
namically immature; 

(ii) scaling relations between cluster mass, luminosity and tem- 
perature display scatter of (7~0. 3-0.6; 

(iii) the normalization of the mass-temperature relation for unre- 
laxed (dynamically immature) clusters is 40% hotter than for 
relaxed clusters. 

We discuss these results in ilV.ll and ^^J.2\ and close by considering 
the implications of the results for the use of massive clusters as 
cosmological probes ( i|7.3> . 

7.1 Dynamical Immaturity of Cluster Cores 

Galaxy clusters grow by accreting DM, gas and galaxies from 
their surroundings, including the filamentary structure (iQ. The 



observed structure of clusters therefore probes a combination of 
both the in-fall history and the relaxation processes that govern the 
timescales on which clusters regain equilibrium following cluster- 
cluster mergers. 

The clusters can be grouped into three categories on the basis 
of the mass and X-ray flux maps in Fig.Q The least ambiguous cat- 
egory is the relaxed clusters (A 383, A963 and A 1835 - see ^6. It . 
all of which display a similar degree of relaxation in both mass 
and X-ray. The other two categories are sub-divisions of the unre- 
laxed clusters. First, we consider the four unrelaxed clusters with 
obviously bi/tri-modal mass distributions (A 68, A 773, A 2218 and 
A 22 19). Whilst the X-ray flux contours of these clusters are both 
irregular and elongated in the same directions as the mass distribu- 
tions, there is no obvious evidence of bi-modality in the X-ray flux. 
On radial scales greater than a few tens of kpc from the center of the 
cluster BCGs, the mass maps should trace the distribution of DM. 
The ICM in these clusters therefore appears to be more relaxed than 
the DM distribution. DM is generally believed to be collisionless 
(Davis et al. 1985; however see also Spergel & Steinhardt 2000); in 
contrast, the ICM is baryonic and therefore collisional. The absence 
of X-ray bi-modality in clusters with mass bi-modality therefore 
qualitatively supports the collisionless DM hypothesis. 

The third category comprises the remaining three unrelaxed 
clusters (A 209, A 267 and A 1763) for which there is no com- 
pelling evidence in the current data for bi/tri-modality in the DM 
distribution. Nevertheless, several clues as to the true DM distri- 
butions are present in the data. For example, the strongly elliptical 
mass distribution of A 267 may indicate that the matter distribu- 
tion is more complex than a single elongated DM halo plus cluster 
galaxy population (Edge et al. 2003). Deep wide-field space-based 
imaging is needed to explore this possibility, specifically to search 
for evidence of other mass concentrations using weak gravitational 
lensing (Kneib et al. 2003). Turning to A 1763, galaxies appear to 
be falling in to this cluster along a ~2Mpc long filament (Vallee 
& Bridle 1982; Bardeau et al. 2004, in prep.). This cluster may 
therefore be experiencing a merger in the plane of the sky, which is 
thus poorly sampled by the small field-of-view of the WFPC2 data 
used in this study. An alternative interpretation of these three clus- 
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ters is that sufficient time has elapsed since the presumed merger 
event for the DM distribution to relax and thus not present a bi/tri- 
modal structure at the epoch of observation. However, given the 
strong dynamical disturbance in the X-ray flux maps, we consider 
this unlikely. 

Ultimately cross-correlation of mass and X-ray maps of these 
and similarly selected samples of clusters at higher redshifts will 
help to constrain the relevant relaxation timescales and processes. 
Critical next steps towards that goal are to obtain wider-field 
space-based imaging to overcome uncertainties arising from the 
current tiny pencil-beam WFPC2 observations of the cluster cores, 
plus enlargement of the sample to overcome the small number 
statistics of comparing sub-samples of ~3 systems. Uniformly 
deep X-ray data would also enable tight constraints on the tem- 
perature structure of the unrelaxed clusters to be achieved. Such 
temperature maps would help to more clearly delineate the struc- 
ture of the ICM in the unrelaxed clusters. 

From a theoretical perspective, numerical simulations suggest 
that both gas dynamics and substructure in the DM distribution may 
persist for as long as ~5 Gyr following a merger event (Schindler 
& Miiller 1993; Nakamura, Hattori Sl Mineshige 1995; Roettiger, 
Loken & Bums 1997; Tormen, Diaferio & Syer 1998; de Lucia 
et al. 2004). This suggests that despite the expected differences be- 
tween the physics of the ICM and the DM, the relaxation timescales 
for these two matter components may be comparable. The differ- 
ences in cluster X-ray and mass morphologies noted above imply 
that the X-ray and mass morphological evolution of clusters may 
follow different evolutionary paths, even if the overall relaxation 
timescales are indeed comparable. This idea is supported by the 
numerical simulations of Ricker & Sarazin (2001), who found that 
oscillations in the gravitational potential of a merging cluster (due 
to the dominant collisionless DM) sustain turbulence and thus non- 
relaxation of the ICM on timescales comparable with that required 
for the DM to achieve equilibrium. 

7.2 Cluster Scaling Relations 

Numerous observational and theoretical studies of cluster scaling 
relationships have addressed the slope, normalization and intrin- 
sic scatter of these relations. Structural segregation in the scaling 
relation planes has also been discussed in the context of cooling- 
flow and non-cooling-flow clusters in the luminosity-temperature 
plane (Fabian et al. 1994; Markevitch 1998; Allen & Fabian 1998; 
Arnaud & Evrard 1999). Adding gravitational lensing as a direct 
and clean probe of cluster mass to scaling relation studies parallels 
the importance of adding this diagnostic into the structural studies 
discussed above. Lensing-based mass estimates have been used to 
supplement X-ray cluster studies (e.g. Allen et al. 2003), however 
these studies have relied on weak-lensing data without an absolute 
mass normalization from strong-lensing constraints. The resulting 
large error bars therefore significantly degrade the advantage of 
using lensing as a probe of cluster mass. Small et al. (1997) at- 
tempted to include lensing in cluster scaling relation studies, using 
the weak-shear signal of optically selected clusters as a surrogate 
for mass to construct a shear-Lx relation. Hjorth et al. (1998) went 
a step further, using weak-lensing mass estimates to estimate the 
mass of eight clusters, and thus construct the first lensing-based 
mass-temperature relation. However, as these authors point out, 
their use of weak-lensing and heterogeneous selection function un- 
dermines the precision of their work. 

The important advantage of our study is that the lensing- 
based mass measurements are based on detailed strong-lensing 



mass models, the normalization of which are locked down by 
spectroscopically confirmed multiple-image systems and cross- 
calibration between strong-lensing and weak-lensing constrained 
clusters. Combining the lensing results with high quality X-ray 
spectro-imaging with Chandra places us in a hitherto unique po- 
sition to explore the mass-temperature plane. We therefore con- 
centrate our discussion of cluster scaling relations in this area. It 
is also important to note that the mass information extracted from 
the lensing mass maps is two-dimensional. In contrast, the infor- 
mation extracted from theoretical simulations is three-dimensional. 
Reliable calibrations between 2- and 3-dimensional cluster masses 
have not yet been achieved. We therefore concentrate on discussing 
the scatter in the mass-temperature plane and the related issue of 
structural segregation, i.e. the relative normalization of relaxed and 
unrelaxed clusters. 

7.2.1 Scatter 

Evrard, Metzler & Navarro (1996) predicted that solely a broad- 
beam measurement of the temperature of ICM in an individual clus- 
ter can be used to measure cluster masses to an rms precision of 
~10-20%. This is in stark contrast to the um—Q-Q scatter in mass 
that we detect in il6.2l An important clue as to the origin of this dif- 
ference is that the observational scatter appears to be dominated by 
the unrelaxed clusters (see Table|7}. However several other factors 
may also contribute to both the size of the observed scatter and the 
apparent discrepancy between observation and theory. 

We first consider the issue of aperture size. The observational 
mass measurements sample just the central _R^500kpc of each 
cluster; this aperture matches approximately the radius at which the 
cluster density is 2500 x the critical density, i.e. (Sc=2500 (Smith 
et al. 2003). The scatter may therefore be dominated by systemat- 
ics relevant only to the very central regions of the clusters. We use 
the ground-based weak-lensing analysis of the same clusters by 
Bardeau et al. (2004, in prep.) to make a preliminary estimate of 
how the scatter may reduce if the current analysis were extended to 
larger radii (5c— 500). The uncertainties on Bardeau et al.'s results 
are largely due to the poor spatial resolution of ground-based data 
relative to our HST data. However comparison of the two datasets 
suggests that one-third of the scatter may be due to the small aper- 
ture size employed in this study. This variation in scatter as a func- 
tion of overdensity is consistent with recent observational results 
in the X-ray pass-band (e.g. Ettori, De Grandi & Molendi 2002). 
Therefore, after taking account of aperture size, the scatter remains 
a factor of ~3 larger than the simulations. 

A further potentially important issue is that of selection ef- 
fects. Ricker & Sarazin's (2001; see also Ritchie Sl Thomas 2002) 
simulations suggest that cluster-cluster mergers boost cluster X- 
ray luminosities on timescales of a few Gyrs. This may lead our 
X-ray luminosity selected sample to be biased toward merging sys- 
tems. However the most luminous cluster in the sample is A 1835, 
a canonical relaxed cluster with a cool core, indeed a central excess 
of X-ray flux is a signature of the most relaxed clusters. Overall, 
the details of cluster selection are a complicated subject, however 
there is no strong evidence that the sample is biased toward un- 
relaxed systems. Nevertheless, a careful like-for-like comparison 
between synthetic and observational cluster selection functions is 
an important outstanding task. 

In summary, we find a factor of 3 more scatter in the mass- 
temperature relation than predicted. Further work is required to en- 
sure that measurement methods applied to synthetic and observed 
datasets are well matched, and to extend the space-based lensing 
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results to wider fields-of-view. However, even after this work has 
been completed, the disagreement between observation and theory 
may persist, which would be a signature of missing physics in the 
theoretical descriptions of cluster assembly and relaxation. 

7.2.2 Structural Segregation 

In il6.2l we identified that unrelaxed clusters are 40% hotter than 
relaxed clusters at 2.5-g significance. Clearly, the aperture size is- 
sue noted above also impacts on this result, and wider-field space- 
based weak-lensing observations of a statistically complete sample 
are critical to a thorough investigation of this uncertainty. Neverthe- 
less, the comparison with Bardeau et al.'s (2004, in prep.) ground- 
based weak-lensing results provides important reassurance that a 
substantial fraction of the 40% temperature offset is a genuine 
physical effect. 

Indeed, recent observational and theoretical work supports 
the idea that unrelaxed clusters are hotter than relaxed systems. 
Using spatially resolved spectroscopy with BeppoSAX, Ettori, 
De Grandi & Molendi (2002) identified the normalization of the 
mass-temperature relation of "non-cooling flow" clusters to be 
hotter than that of "cooling flow" clusters, however the large un- 
certainties limited the statistical significance of this result. On the 
theoretical side, Ricker & Sarazin's (2001) simulations of cluster- 
cluster mergers indicate that merger-induced boosts of up to a fac- 
tor of ten (the amplitude of the boost depends on the mass ratio 
between the merging clusters and the impact parameter of the colli- 
sion) in temperature can occur on short timescales (<lGyr) due to 
shock-heating of the gas in a major merger. This relatively brief lu- 
minosity boost suggests that not many clusters in our sample should 
have temperatures >3x higher than the mean relation. This is in- 
deed the case, with A 22 1 9 being the possible sole example of a 
cluster currently experiencing a temperature boost of this magni- 
tude. This cluster has previously been identified as having recently 
experienced a core-penetrating merger (e.g. Small et al. 1995; Gio- 
vannini, Tordi & Feretti 1999) 

Despite the short-lived extreme temperature boosts, merger 
remnants appear to asymptote to temperatures ~ 10-40% higher 
than the pre-merger configuration, the precise long-term boost 
again depending on the mass ratio and impact geometry. Assuming 
that elimination of aperture size and related issues ( i|7.2.I> reduces 
the temperature offset between relaxed and unrelaxed clusters to 
~20-30%, these theoretical results support the idea that the cluster 
mergers are responsible for the structural segregation of clusters in 
the mass-temperature plane. 

If empirically clusters are either relaxed hosts of cool cores, or 
unrelaxed (i.e. merging or merger remnant) systems without cool 
cores, then merger-boosts may be sufficient to explain the struc- 
tural segregation. However deep integrations with Chandra and 
XMM-Newton indicate that the picture may not be so clear-cut. For 
example some cool core clusters appear to be undergoing merger 
activity (e.g. Perseus - Churazov, et al. 2003), and some dynam- 
ically relaxed clusters do not host a cool core (e.g. A 963 in this 
work). The significance of merging cool core clusters and relaxed 
non-cool core clusters for the merger-boost interpretation of struc- 
tural segregation in the mass-temperature plane is unclear at this 
time. For example the mass ratio of the Perseus merger may be 
so large (i.e. the mass of the in-falling system so small relative 
to Perseus) as to not be relevant to the current discussion. How- 
ever it does suggest that alternative mechanisms such as cluster- 
to-cluster variations in the level of initial pre-heating may be an 
additional complication when interpreting the demographics of re- 



laxed/unrelaxed and cool core/non-cool core clusters (Babul, Mc- 
Carthy & Poole, 2003). 

7.3 Implications for Cluster Cosmology 

Massive galaxy clusters are one of a number of complementary 
probes of cosmological parameters. For example, many studies 
have used empirical determinations of cluster scaling relations to 
convert the observed X-ray luminosity and/or temperature func- 
tions into mass functions. The most massive clusters are rare ob- 
jects, and thus the constraints on the high-mass end of the mass 
function inferred from such experiments enables constraints on a 
combination of Qm and erg in principal to be achieved. Recent 
cluster-based estimates of erg have yielded discrepant results, with 
most estimates of erg falling in the range ~0.6-I (e.g. Eke et al. 
1996; Nevalainen, Markevitch & Forman 2000; Allen, Schmidt & 
Fabian 2001; Borgani et al. 2001; Pierpaoli, Scott & White 2001; 
Reiprich & Bohringer 2002; Viana et al. 2002). The critical step 
in these experiments is the conversion from observable (i.e. X-ray 
luminosity or temperature) to mass. Our results suggest that care 
must be taken to incorporate sufficient scatter in the observable- 
mass relationship. The asymmetric scatter of the mass-temperature 
relationship arising from structural segregation of clusters in this 
plane implies that such issues may be most acute when converting 
from cluster temperature to mass, especially if the cluster selection 
function is poorly characterized. Indeed, in a companion to this ar- 
ticle. Smith et al. (2003) showed that incomplete understanding of 
the cluster selection function can cause ~20% systematic uncer- 
tainties in as, favouring values to the lower end of the range found 
in other recent works. 

In a similar vein, we note that our results will likely impinge 
on cosmological results derived from SZE surveys (see Carlstrom, 
Holder & Reese 2002 for a recent review). Again, the key issue is 
the precision to which the observable-mass relationship is known; 
in this case the observable is the temperature of the ICM, as derived 
from the SZE signal. The issues for SZE surveys may be aggravated 
because the goal of measuring the dark energy equation-of-state 
parameter w, relies at least in part on measuring the evolution of 
the cluster population between two redshifts straddling the epoch 
at which the dark energy is thought to take over as the dominant 
factor in the expansion of the universe. This is in contrast to mea- 
suring a combination of erg and flu from studies of local clusters, 
i.e. just one redshift slice. Further detailed wide-field investigations 
of massive clusters at both low (^=0.2) and higher (z>0.6) red- 
shifts are therefore needed to quantify robustly the evolution of the 
dynamical maturity of clusters and the impact of that on cluster 
scaling relations. We suggest that, in the light of major imminent 
SZE cluster surveys, this is an urgent exercise. 



8 SUMMARY AND CONCLUSIONS 

We have undertaken a comprehensive study of the distribution of 
mass in ten X-ray luminous (Lx^8x 10*''ergs^^[0.1-2.4keV]) 
galaxy clusters at 2;=0.21±0.04. The cornerstone of our analysis 
is a suite of detailed gravitational lens models that describe the 
distribution of total mass in the cluster cores. These models are 
constrained by the gravitational lensing signal detected in high- 
resolution HSTfWFPCl imaging of the clusters, including numer- 
ous multiply-imaged and weakly-sheared background galaxies. 
Analysis of archival Chandra observations complements the lens- 
ing analysis and enables us to relate the total mass and structure 
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of the clusters to the thermodynamics of the intra-cluster medium. 
We re-cap the key results. 

(i) Five of the ten clusters contain spectroscopically confirmed 
strong gravitational lensing, i.e. multiply-imaged background 
galaxies. These five clusters comprise: A 68 (Smith et al. 
2002b; Ell, A 383 (Smith et al. 2001; Sand et al. 2004), 
A963 (EUis et al. 1991), A2218 (Pello et al. 1992; Ebbels 
et al. 1998; EUis et al. 2002) and A 2219 (EIJ. 

(ii) Of the remaining five clusters, two contain relatively un- 
ambiguous examples of strong lensing for which spectro- 
scopic redshifts are not yet available (A 267 and A 1835). The 
other three clusters, A 209, A 773 and A 1763 do not contain 
any obviously multiply-imaged galaxies, however the optical 
richness and massive nature of A 773 imply that this cluster 
may well contain strong lensing that has yet to be uncovered. 

(iii) Based on our search for strong-lensing in these clusters down 
to a surface brightness limit of /i702— 25 mag arcsec"^, we 
therefore put a firm lower limit on the fraction of the cluster 
sample that have a central projected mass density in excess of 
the critical density required for gravitational lensing of 50%. 
Including A 267 and A 1835 increases this limit to 70%. 

(iv) We use the strong- and weak-lensing signals to constrain 
parametrized models of the cluster potential wells, and from 
these models compute maps of the total projected mass in the 
cluster cores. Spatial analysis of these maps reveals that four 
of the clusters form a homogeneous sub-sample with very 
high central mass fractions (Mccn/A'/tot>0.95). The remain- 
ing six are strongly heterogeneous, with central mass frac- 
tions in the range 0.4^Mcon/Aftot5;0.9. The central mass 
fraction of A/ccn/.Aftot— 0.95 that divides these two popula- 
tions corresponds to a if-band central luminosity fraction of 

-^K,BCG/iK,tot'~^0.5. 

(v) All of the six low central mass fraction clusters have an ir- 
regular, but not obviously bi/tri-modal X-ray morphology. 
Four of the six are constrained by the current lensing data to 
have a bi/tri-modal mass morphology (A 68, A773, A2218, 
A 2219). The other two (A 209 and A 1763) may be merg- 
ing in the plane of the sky and thus any multi-modality in 
their mass distributions is not well-sampled by our WFPC2 
pencil-beam survey of the cluster cores. 

(vi) Three of the four high central mass fraction clusters also have 
relaxed X-ray morphologies. The remaining cluster (A 267) 
has a disturbed X-ray morphology, with a ~90 kpc offset be- 
tween its centers of X-ray emission and mass. The distribu- 
tion of mass in this cluster may therefore be more complex 
than the single dark matter halo (plus cluster galaxies) that 
the current data are able to constrain. 

(vii) Combining all of the information available to us, we de- 
fine "relaxed" clusters to be those which appear relaxed 
in all available diagnostics, with the exception that we 
do not require a cool core. Quantitatively relaxed clus- 
ters therefore have a single cluster-scale DM halo in 
their lens model (A^dm = 1), a high central mass fraction 
(Mccn/Aftot^O.95) and central K-hanA luminosity fraction 
(iif,BCG/iA',tot^0.5), no evidence for an offset between 
the X-ray emission and the center of mass Arpcak<4kpc) 
and the X-ray morphology is either circular, or mildly ellip- 
tical. The unrelaxed clusters do not meet at least one of these 
criteria. 



(viii) Applying these criteria to the cluster sample, we conclude that 
seven of the ten clusters are dynamically immature, i.e. unre- 
laxed (A 68, A209, A267, A773, A 1763, A2218, A2219) 
and three are relaxed (A 383, A 963, A 1835); thus, formally 
70±20% of X-ray luminous cluster cores at z=Q.2 are unre- 
laxed. 

(ix) We detect a factor of three more scatter in the mass- 
temperature plane than predicted by Evrard, Metzler & 
Navarro (1996), implying that great care should be exercised 
when using such relations to convert cluster temperature func- 
tions to mass functions in pursuit of cosmological parameters. 
We also consider a number of key uncertainties that may ar- 
tificially inflate our estimate of the scatter. This exercise sug- 
gests that approximately one third of the scatter detected in 
this study may be due to issues related to the small field-of- 
view of our WFPC2 observations. 

(x) The scatter in the mass-temperature plane is asymmetric, pre- 
senting evidence of structural segregation. The normalization 
of the mass-temperature relation for unrelaxed (dynamically 
immature) clusters is 40% hotter than for relaxed clusters at 
2.5-(T significance. This result is consistent with recent sim- 
ulations of cluster-cluster mergers (Ricker & Sarazin 2001; 
Randall, Sarazin & Ricker 2002), implying that merger in- 
duced temperature boosts may be the dominant factor behind 
the hotter normalization of unrelaxed systems. 



In summary, this study is the first of its kind, exploiting de- 
tailed strong-lensing constraints on the distribution of mass in 
X-ray luminous cluster cores, complemented by X-ray spectro- 
imaging with Chandra. The high frequency of dynamical immatu- 
rity, coupled with the structural segregation of the clusters in the 
mass-temperature plane have profound implications for our under- 
standing of how clusters form and evolve. Perhaps of greatest im- 
portance at this time is the implications of these results for using 
clusters to constrain the cosmological parameters, Q.m, o"8 and the 
dark energy equation-of~state parameter w. In a companion paper 
we demonstrate that inadequate knowledge of the cluster selection 
function can lead to 20% systematic errors in cluster-based mea- 
surements of (Tg. Turning to w, forthcoming Sunyaev-Zeldovich 
Effect experiments designed to detect and measure the mass of 
clusters out to high redshifts, using mass-temperature scaling rela- 
tions may be compromised by unidentified and/or poorly calibrated 
astrophysical systematic uncertainties (see also Majumdar & Mohr 
2003). 

Our future program will build on these results in three ways. 
First, we aim to overcome the principal uncertainties in the current 
work: small number statistics and tiny field-of-view. Wide-field 
space-based imaging of a statistically complete sample of clus- 
ters would be essential to achieve this goal. Second, detailed com- 
parison of selection effects and measurement techniques between 
theoretical and observational studies will enable more detailed and 
rigorous comparison between observational and synthetic datasets. 
Finally, we are gathering HSTiACS imaging of an identically se- 
lected sample of clusters at 2:~0.55 drawn from the MACS sample 
(Ebeling et al. 2001b). We will combine these data with observa- 
tions in the X-ray pass-band and compare the results to those found 
here. The ~3Gyr difference in look-back-time between z—0.2 
and 2=0.55 will enable us to search for evolutionary trends in the 
most massive clusters. 
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APPENDIX A: GRAVITATIONAL LENS MODELLING - METHOD 

This appendix describes relevant details of how the LENSTOOL ray-tracing code is used to construct robust models of the distribution of 
mass in galaxy clusters using both strong- and weak-lensing constraints. 



Al Mathematical Overview 

Consider a single source at zs that appears to an observer under the action of a gravitational lens at zl as A'^ distinct images at positions 
Ui (l^i^N). We describe the source with v free parameters, Ilj (l^j^i/), for example: the position of the center of the source, the 
ellipticity, the orientation, the surface brightness. We write the transformation equations in the following form: 

nf=/,(nj„<^(^iO) (i^i^N) (i^jfiu) (Al) 

where fj are functions that depend on the parameters that describe the observed images and the gravitational potential of the lens. The source 
parameters , Ylf , and the lens potential, ipiui), are the unknowns in these equations. We use the image parameters , Ilji (i.e. the observables), 
to constrain both the source parameters and the lens potential. If we are able to recover v parameters for each image, then we have u{N~l) 
constraints on our lens model. Generalizing this to n sets of multiple images of sources at redshifts zsi, each multiple-image system being 
characterized by {Pi,Ni), then the total number of constraints ric is given by: 

nc=^h(Af>-l)-6,] (A2) 

i 

where 6^=0 if 23^ is known and £^=1 if zsi is not known (Kneib et al. 1993). Strictly, Eguation lAll onlv applies in the idealized case of all 
multiple-images being resolved, and none of the images being merging pairs. Clearly higher resolution imaging will increase Uc- 

We describe each observed gravitational image with the following parameters: H' = {u' , ,f^), where is the position of the image, 
is the observed flux and r^^r^e^*" is the complex deformation of the image, describing its ellipticity (r) and orientation {6). We use 
these quantities and their counterparts in the source-plane to write down the transformation equations: 

Position : —u' ip{u' ) 

Flux: S'^=|det JlS-f (A3) 
Shape : sgn(det J) f « = - fpot [5' -t' ^{f g;^,)] 

where the first equation is simply the lens equation, J is the Jacobian matrix of the tensing transformation, S={1+t'^Y^'^ , 9 = 1 /(l^'*^)' 
the subscript "pot" denotes quantities applicable to a circular source (see Kneib et al. (1996) and references therein for a detailed derivation 
of the shape transformation equation), S takes the real part of dot-product between g and g^^t' ^nd gf is the complex conjugate of g. 



A2 Parametrization of the Lens Plane Mass Distribution 

Each mass component is parametrized as a smoothly truncated pseudo-isothermal elliptical mass distribution (PIEMD - Kassiola &. Kovner 
1993; Kneib et al. 1996). This functional form is physically well motivated (it avoids the central singularity and infinite spatial extent of 
singular isothermal models) and can describe mass distributions of arbitrarily large ellipticities. Each PIEMD mass component is parametrized 
by its position (Xc, J/c), ellipticity (e), orientation {9), core radius (rcorc), cut-off radius (rcut) and central velocity dispersion (cto), and the 
projected mass density, S is given by: 

v^/ N '^o^ ''cut (1 1 \ /A .IX 



2G Tent - W V ('■coro^ +p2)l/2 (r.^t^ +p2)l/2 

where p'^ = [{x— Xc) / {l+eYi^ + [{y— j/c)/(l— e)]^ and the ellipticity of the lens is defined as e—(a—b) / (a+6). The geometrical parameters 
{Xc, i/c, e, Q) of each mass component are matched to the observed light distribution of the related cluster galaxy. The dynamical parameters 
(tco-tc, ''cut, cto) of the "major mass components" (i.e. cluster-scale mass components and selected bright cluster galaxies, including each 
central galaxy) are kept as free parameters. To minimize the number of model parameters, the dynamical parameters of the remaining mass 
components are scaled with the luminosity of their associated galaxy following Brainerd, Blandford & Small (1996): 

J-corc = ^corc* (J^/-'^*)^''^; ^cut = ^cut* (-L/i*)^''^; G o = (To* {L / L* )^ ' (A5) 

We also scale the mass of individual galaxies with their luminosity, using: 

M={-K/G){oo*f rent* (L/L*) (A6) 

These scaling laws are physically well motivated and conserve the mass-to-light ratio of the galaxies in a manner analogous to the observed 
Faber-Jackson and Tully-Fisher scaling relations for spiral and elliptical galaxies respectively. 

A3 Model Optimization 

We construct a x'^-estimator to quantify how well each trial lens model fits the observational data: 

X Xpos^"Xshapc^"Xfiux^-Xcrit~^"Xwcak 

(A7) 
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fiy Q o Q O 

The first three terms compare the source-plane properties computed for each observed image (xf , y° , t° , Of , - as defined in equation 
EU- We define Xpos> Xshapo and xlux as follows: 

2 ixf-xf_,f+{yf-yf_^f 

^pos 

2 (rf cos (2ef ) -rf_ 1 cos (2ef_ i ) f+{Tfsin{2ef ) -rf_ ^ sm(2gf_ ^ ) ) ^ 

Xshape= 2^ (A9) 



shape 



xiux= 2^ 2 '"" (Aio) 



i — 1 

where (Tpos, f-^jj^^p^ and a^^^ are the accuracies with which we can measure the position, shape and flux of galaxies in our HST data. 

The fourth term in eauation lA7l compares how well the symmetry breaks in the observed gravitational images (i.e. locations of critical 
lines) are reproduced by the lens model. We define {x°t", y°t^) and (2;™°'', y^°'^) as the observed and model critical line positions respectively 
and construct Xcrit > where Aa^crit and Aycrit are the uncertainties in the position of the symmetry break. 



Xcrit = ^^'^^ r 2 \ ^"2 ' (All) 

Finally, we construct Xwcak a similar manner to xfimpo' the differences being that we sum over the F faint galaxy images detected in 
the cluster field, cr weak is the width of the distribution of galaxy shapes from surveys of field galaxies (e.g. Ebbels 1998) and we compare the 
image-plane galaxy shapes with that induced by the trial mass distribution at the faint galaxy image on a circular source. 



(x^^'-xT^f+iy^^^-yT"'^ 



Xwcak= > ^ 2 (^12) 



F- 



^ (r/cos(2ef)-rp^otCos(26l^ot))'+(^/sin(26'f)-rp'otsm(2e^oO) 



i = l 



The estimator is minimized by varying the model parameters to obtain an acceptable (x^~l) fit to the observational constraints. This 
is an iterative process, which we begin by restricting our attention to the least ambiguous model constraints (i.e. the confirmed multiple-image 
systems) and the relevant free parameters. For example, in a typical cluster lens there will be one spectroscopically-confirmed multiple-image 
system and a few other candidate multiples. The model fitting process therefore begins with using the spectroscopic multiple to constrain the 
dynamical parameters of the main cluster-scale mass component. 
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